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TITLE OF THE INVENTION 
Methods and Agents for Screening for Compounds Capable of Modulating VEGF Expression 

BACKGROUND OF THE INVENTION 
Aberrant angiogenesis plays a role in the pathogenesis of numerous diseases, 
including malignant, ischemic, inflammatory and immune disorders (reviewed in Matter, 
Drug Discovery Today, (5:1005-1024 (2001); Yancopoulos et al, Nature, 407:242-248(2000); 
Carmeliet, Nat. Med., 9(6):653-660 (2003); Ferrara, Semin. Oncol, 29(6 Suppl 7(5^:10-14 
(2002)). Vascular Endothelial Growth Factor (VEGF), an angiogenesis regulator, plays a 
central role in angiogenesis. In particular, VEGF is an important factor in the pathogenesis of 
cancers, diabetic retinopathy (DR), and exudative macular degeneration (reviewed in Tandle 
et al, Clin. Adv. in Hemat. and Oncol, 7(7j:41-48 (2003); Ferrara et al, Nat. Med., 
5(727:1359-1364 (1999); Matter, supra; Carmeliet supra; Kerbel et al,Nat. Rev. Cancer, 
2(1 0):727-739 (2002); Witmer et al, Prog. Retin. Eye Res., 22(l):\-29 (2003); Clark et al., 
Nat. Rev. Drug Discover, 2:448-459 (2003); Ferrara (2002), supra; Thomas, J. Biol Chem., 
271:603-606 (1996); Gerber et al, Development, 726:1149-1159 (1999); which are hereby 
incorporated by reference), the last two of which are leading causes of blindness in the United 
States. 

The expression of VEGF is regulated by a number of factors and agents including 
cytokines, growth factors, steroid hormones and chemicals, and mutations that modulate the 
activity of oncogenes such as ras or the tumor suppressor gene VHL (Maxwell et al, Nature, 
3PP.-271-275 (1999); Rak et al, Cancer Res., 60:490-498 (2000)). In part, VEGF expression 
is regulated after transcription by sequences in both the 5'- and 3 '-untranslated regions 
(UTRs) of its mRNA (Ikeda et al, J. Biol Chem., 270:19761-19766 (1995); Stein et al, Mol 
Cell. Biol, 75:3112-3119 (1998); Levy et al. J. Biol Chem., 277:2746-2753 (1996); Huez et 
al, Mol Cell Biol, 75:6178-6190 (1998); Akiri et al, Oncogene, 77:227-236 (1998)). The 
VEGF 5' UTR is unusually long and GC rich, and it contains an internal ribosomal entry site 
(IRES) that is reported to mediate a unique, cap-independent mode of translation initiation. 
The VEGF 3' UTR harbors multiple AU-rich stability determinants that have been shown to 
be associated with VEGF mRNA turnover rates. 
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Initiation of translation of the VEGF mRNA is reported to be unique under hypoxic 
conditions in that it is mediated via an internal ribosome entry site (IRES) within the VEGF 
5' UTR (Stein et al, supra; Levy et al, supra; Huez et al, supra; Akiri et al, supra). Under 
hypoxic conditions, cap-dependent translation is dramatically impaired and the translation of 
the VEGF mRNA occurs through its cap-independent IRES. Initiation of translation of most 
eukaryotic mRNA is cap-dependent. IRES-mediated translation initiation becomes 
predominant when components of the translation initiation complex become rate-limiting, 
e.g., during hypoxia (Mitchell et al, Mol Cell, U(3):151-lll (2003)). 

Several investigators have used in vitro and bicistronic strategies, often in conjunction 
with deletion mutants, to study regulation of VEGF. Prats and colleagues reported the 
occurrence of cap-independent translation of human VEGF by virtue of an IRES. From these 
studies, they postulated the presence of two IRESs, a first IRES (IRES A) located within 300 
nucleotides of the initiation codon and a second IRES (IRES B) located in the upstream half 
of the 5' -UTR. Huez et al, supra. In Stein et al, supra., deletion mutants in dicistronic and 
monocistronic constructs were used to identify sequences of the VEGF 5' UTR required for 
maximal IRES activity. Keshet and colleagues reported an increase in IRES activity from a 
163-nucleotide sequence derived from a VEGF 5' UTR, which is possibly an artifact from 
RT-PCR amplification, relative to the entire full-length VEGF 5' UTR. Stein et al, supra. 
Goodall and colleagues reported a deletion analysis of IRES residues toward the 3' end of the 
mouse VEGF 5' UTR, and speculated that, for optimal IRES activity, the upstream half of the 
VEGF 5' UTR is necessary. Miller et al, supra. 

The present invention provides, for the first time, a negative regulator of post- 
transcriptional regulation (NeRP) located in the 5' UTR of VEGF. Removal of a NeRP from 
the VEGF 5' UTR results in increased translation of an operably linked gene dependent on 
the presence of a PTCRE of the present invention. 

SUMMARY OF THE INVENTION 

The present invention includes a nucleic acid construct comprising a nucleic acid 
sequence encoding a reporter polypeptide, where the nucleic acid sequence encoding a 
reporter polypeptide is operably linked to a NeRP, the NeRP is operably linked to a PTCRE, 
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the PTCRE is not SEQ ID NO: 3, and expression of the reporter polypeptide is capable of 
being modulated relative to in an absence of the NeRP. 

The present invention also includes a nucleic acid molecule comprising a nucleic acid 
sequence encoding a reporter polypeptide and a VEGF 5' UTR nucleic acid sequence in an 
absence of SEQ ID NO: 4. 

The present invention also includes a nucleic acid molecule comprising a nucleic acid 
sequence encoding a reporter polypeptide operably linked to a VEGF 5' UTR in an absence 
ofSEQIDNO:4. 

The present invention also includes a nucleic acid molecule comprising a nucleic acid 
sequence encoding a reporter polypeptide, where the nucleic acid sequence encoding a 
reporter polypeptide is operably linked downstream of a UTR containing a NeRP, and the 
UTR is not operably upstream of SEQ ID NO: 3. 

The present invention also includes a heterogeneous population of nucleic acid 
molecules, where the heterogeneous population comprises a reporter nucleic acid sequence, 
and the nucleic acid sequence encoding a reporter polypeptide is operably linked to a VEGF 
5' UTR in an absence of NeRP 1 (SEQ ID NO: 4). 

The present invention also includes a substantially purified nucleic acid molecule 
comprising between 95% and 99% sequence identity with a nucleic acid molecule of SEQ ID 
NO: 3, a fragment thereof, or a complement of either. 

The present invention also includes a substantially purified nucleic acid molecule 
consisting of SEQ ID NO: 3, a fragment thereof, or a complement of either. 

The present invention also includes a substantially purified nucleic acid molecule 
consisting of a first nucleic acid sequence linked to a heterologous nucleic acid sequence 
encoding a polypeptide, where the first nucleic acid sequence is selected from the group 
consisting of SEQ ID NO: 3, a fragment thereof, and a complement of either. 

The present invention also includes a substantially purified nucleic acid molecule 
comprising between 95% and 99% sequence identity with a nucleic acid molecule of SEQ ID 
NO: 4, a fragment thereof, or a complement of either. 

The present invention also includes a substantially purified nucleic acid molecule of a 
nucleic acid sequence selected from a group consisting of SEQ ID NO: 4, a fragment thereof, 
and a complement of either. 
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The present invention also includes a substantially purified nucleic acid molecule 
consisting of a first nucleic acid sequence linked to a heterologous nucleic acid sequence 
encoding a polypeptide, where the first nucleic acid sequence is selected from the group 
consisting of SEQ ID NO: 4, a fragment thereof, and a complement of either. 

The present invention also includes a method of making a nucleic acid construct to 
screen for a compound comprising: a) providing a main ORF downstream of a promoter in 
the nucleic acid construct; b) operably linking a VEGF 5' UTR in an absence of SEQ ID NO: 
4 upstream of the main ORF; and c) operably linking a VEGF 3' UTR downstream of the 
main ORF. 

The present invention also includes a method of screening in vivo for a compound that 
modulates UTR-dependent expression comprising: a) providing a cell having a nucleic acid 
molecule comprising a promoter upstream from a VEGF 5' UTR in an absence of SEQ ID 
NO: 4, where the VEGF 5' UTR in an absence of SEQ ID NO: 4 is upstream from a nucleic 
acid sequence encoding a reporter polypeptide, and the nucleic acid sequence encoding a 
reporter polypeptide is upstream from a VEGF 3' UTR; b) contacting the cell with a 
compound; c) producing a nucleic acid molecule that contains a nucleic acid sequence 
encoding a reporter polypeptide and does not contain SEQ ID NO: 4; and d) detecting the 
reporter polypeptide. 

The present invention also includes a method of screening in vitro for a compound 
that modulates UTR-affected expression comprising: a) providing an in vitro translation 
system; b) contacting the in vitro translation system with a compound and a nucleic acid 
molecule comprising a VEGF 5' UTR in an absence of SEQ ID NO: 4, where the VEGF 5' 
UTR in an absence of SEQ ID NO: 4 is upstream from a nucleic acid sequence encoding a 
reporter polypeptide and the nucleic acid sequence encoding a reporter polypeptide is 
upstream from a VEGF 3 ' UTR; and c) detecting the reporter polypeptide in vitro. 

The present invention also includes a method of expressing a nucleic acid molecule in 
a cell comprising: a) providing a nucleic acid molecule to a cell, where the nucleic acid 
molecule comprises a nucleic acid sequence encoding a reporter polypeptide flanked by 
VEGF UTRs in an absence of SEQ ID NO: 4; and b) detecting the reporter polypeptide. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a main ORF-independent, UTR-affected mechanism 
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comprising: a) growing a stable cell line having a reporter gene operably linked to a VEGF 
5' UTR in an absence of SEQ ID NO: 4; b) comparing the stable cell line in a presence of a 
compound relative to the stable cell line in an absence of the compound; and c) selecting for 
the compound that modulates protein expression through a main ORF -independent, UTR- 
affected mechanism. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a main ORF-independent, UTR-affected mechanism 
comprising: a) growing a stable cell line having a main ORF operably linked to a VEGF 5' 
UTR in an absence of SEQ ID NO: 4; b) comparing the stable cell line in the presence of a 
compound relative to the stable cell line in the absence of the compound; and c) selecting for 
the compound that modulates protein expression through a main ORF-independent, UTR- 
affected mechanism. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a VEGF-independent, UTR-affected mechanism 
comprising: a) substituting in vivo a VEGF gene with a reporter gene, where a UTR 
consisting of SEQ ID NO: 3 is operably linked to the reporter gene, and the substitution 
occurs in a differentiated cell; b) growing the differentiated cell; and c) selecting for the 
compound that modulates protein expression of the reporter gene through a main ORF- 
independent, UTR-affected mechanism. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a main ORF-independent, UTR-affected mechanism 
comprising: a) substituting in vivo a main ORF with a reporter gene, where a 5' UTR is 
operably linked to the reporter gene and consists of SEQ ID NO: 3, and the substitution 
occurs in a differentiated cell; b) growing the differentiated cell; and c) selecting for the 
compound that modulates protein expression of the reporter gene through a main ORF- 
independent, UTR-affected mechanism. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a UTR-affected mechanism comprising: a) providing a 
stable cell line having a reporter gene operably linked to a VEGF 5' UTR in an absence of 
SEQ ID NO: 4, where the stable cell line mimics post-transcriptional regulation of a VEGF 
gene found in vivo in presence of the compound; b) maintaining the stable cell line; and c) 
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selecting for the compound that modulates protein expression of the reporter gene through a 
UTR-affected mechanism. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a UTR-affected mechanism comprising: a) providing a 
stable cell line having a main ORF encoding a reporter polypeptide operably linked to a 
VEGF 5' UTR in an absence of SEQ ID NO: 4, where the stable cell line mimics post- 
transcriptional regulation of a VEGF gene found in vivo in a presence of a compound; b) 
maintaining the stable cell line; and c) selecting for the compound that modulates protein 
expression of the main ORF through a UTR-affected mechanism. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a UTR-affected mechanism mediating the effect of a 
NeRP comprising: a) growing a stable cell line having a reporter gene operably linked to a 5' 
VEGF UTR in an absence of a NeRP 1 (SEQ ID NO: 4); b) comparing the stable cell line in a 
presence of a compound relative to in an absence of the compound, where the compound 
does not modulate UTR-dependent expression if the 5' VEGF UTR in an absence of a 
NeRPl (SEQ ID NO: 4) is operably linked to a reporter gene; and c) selecting for the 
compound that modulates protein expression of the reporter gene through a UTR-affected 
mechanism mediating the effect of a NeRP. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a UTR-affected mechanism mediating the effect of a 
NeRP comprising: a) growing a stable cell line having a main ORF encoding a reporter 
polypeptide operably linked to a 5' VEGF UTR in an absence of a NeRPl (SEQ ID NO: 4); 
b) comparing the stable cell line in the presence of a compound relative to that in the absence 
of the compound, where the compound does not modulate UTR-dependent expression if the 
5' VEGF UTR in an absence of a NeRPl (SEQ ID NO: 4) is operably linked to a main ORF; 
and c) selecting for the compound that modulates protein expression of the main ORF 
through a UTR-affected mechanism mediating the effect of a NeRP . 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a UTR-affected mechanism mediating the effect of a 
NeRP comprising: a) growing a stable cell line having a reporter gene operably linked to a 
UTR having a NeRPl (SEQ ID NO: 4); b) comparing the stable cell line in a presence of a 
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compound relative to in an absence of the compound, where the compound modulates UTR- 
dependent expression if a NeRPl (SEQ ID NO: 4) is operably linked to a reporter gene; and 
c) selecting for the compound that modulates protein expression of the reporter gene through 
a UTR-affected mechanism mediating the effect of a NeRP. 

The present invention also includes a method of screening for a compound that 
modulates protein expression through a UTR-affected mechanism mediating the effect of a 
NeRP comprising: a) growing a stable cell line having a main ORF encoding a reporter 
polypeptide operably linked to a UTR having a NeRPl (SEQ ID NO: 4); b) comparing the 
stable cell line in a presence of a compound relative to in an absence of the compound, where 
the compound modulates UTR-dependent expression if a NeRPl (SEQ ID NO: 4) is operably 
linked to a main ORF encoding a reporter polypeptide; and c) selecting for the compound that 
modulates protein expression of the main ORF encoding a reporter polypeptide through a 
UTR-affected mechanism mediating the effect of the NeRP. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 sets forth a sequence alignment of VEGF 5' UTR for mouse, rat, and human. 
Figure 2 sets forth an example of a compound that inhibits VEGF expression over a 
concentration range. 

Figure 3 sets forth an example of UTR-dependent inhibition of VEGF expression, 
where VEGF expression is dependent on one or more VEGF UTRs. 

Figure 4A sets forth an example of effective inhibition of VEGF production in tumor 

tissue. 

Figure 4B sets forth the inhibition of human VEGF production in vivo by a 
compound. 

Figure 5 sets forth a schematic of a 5' VEGF UTR and restriction sites therein. VEGF 
5 'UTR is amplified from human genomic DNA by two separate PGR reactions. In the 
overlap region of 5'UTR1 and 5'UTR2, a unique enzyme site BamHIis used to assemble the 
full-length 5 'UTR in subsequent cloning. 

Figure 6 sets forth a schematic representation of dicistronic plasmids that can be used 
for transfection experiments. P21uc/5UTR-ANeRPl is a dicistronic plasmid containing 
VEGF 5'UTR1, in which nucleotides 337 to 1083 of the VEGF cDNA are fused to the firefly 
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luciferase coding sequence; P21uc/5UTR-FL is generated by subcloning VEGF 5'UTR2 into 
the plasmid p21uc/vegf5utrl between Sal /and BamHI; plasmid p21uc/5UTR-A5 1-476 is 
derived from p21uc/5UTR-FL by removing the Nhe / fragment (nt 51 to 746); plasmid 
p21uc/5UTR-A476-1038 is derived from p21uc/vegf5utr-fl by removing the sequence from 
BamHI site to the 3'end of 5'UTR; plasmid p21uc/5UTR-A 1-476 is derived from 
p21uc/5UTR-FL by removing the sequence from BamHI to the 5 'end of 5'UTR. P21uc-e can 
be used as a negative control in this study. 

Figure 7 sets forth results identifying the presence of a VEGF IRES domain (a 
PTCRE) and a NeRP in the VEGF mRNA 5'UTR. Reporter gene expression is analyzed by 
monitoring luciferase activity. 

Description of the Nucleic Acid Sequences 
SEQ ID NO: 1 sets forth a full-length VEGF 5' UTR. 
SEQ ID NO: 2 sets forth an open reading frame encoding VEGF. 
SEQ ID NO: 3 sets forth a PTCRE 1, a 702 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 4 sets forth a NeRPl, a 336 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 5 sets forth a PTCRE2, a 485 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 6 sets forth a PTCRE3, a 556 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 7 sets forth a PTCRE4, a 294 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 8 sets forth a PTCRE5, a 194 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 9 sets forth a NeRP2, a 476 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 10 sets forth a NeRP3, a 554 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 1 1 sets forth a NeRP4, a 51 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 12 sets forth aNeRP5, a 91 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 13 sets forth a NeRP6, a 335 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 14 sets forth aNeRP7, a 332 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 15 sets forth a NeRP8, a 331 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 16 sets forth aNeRP9 : a 330 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 17 sets forth a NeRPIO, a 329 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 1 8 sets forth a NeRPl 1, a 328 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 19 sets forth a NeRP12, a 327 nucleotide region of a VEGF 5' UTR. 
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SEQ ID NO: 20 sets forth a NeRP13, a 326 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 2 1 sets forth a NeRP 1 4, a 3 1 6 nucleotide region of a VEGF 5 ' UTR. 
SEQ ID NO: 22 sets forth a NeRP 15, a 306 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 23 sets forth a NeRP 16, a 296 nucleotide region of a VEGF 5' UTR. 
SEQ ID NO: 24 sets forth a NeRP 1 7, a 286 nucleotide region of a VEGF 5 ' UTR. 
SEQ ID NO: 25 sets forth a NeRP 1 8, a 276 nucleotide region of a VEGF 5 ' UTR. 
SEQ ID NO: 26 sets forth a NeRP 19, a 266 nucleotide region of a VEGF 5' UTR. 

Definitions 

As used herein, the term "construct" refers to an artificially manipulated nucleic acid 
molecule. 

As used herein, the term "heterologous" refers to ingredients or constituents of 
dissimilar or diverse origin. 

As used herein, the term "mammalian cancer cell" or "mammalian tumor cell" refers 
to a cell derived from a mammal that proliferates inappropriately. 

As used herein, the term "main ORF-independent mechanism" refers to a cellular 
pathway or process, wherein at least one step relates to gene expression and is not dependent 
on the nucleic acid sequence of the main open reading frame. 

As used herein, the term "reporter gene" refers to any gene whose expression can be 
measured. 

As used herein, the term "RNA induced gene silencing, or RNA interference (RNAi)" 
refers to the mechanism of double-stranded RNA (dsRNA) introduced into a system to 
reduce protein expression of specific genetic sequence. 

As used herein, the term "specifically bind" means that a compound binds to another 
compound in a manner different from a similar type of compounds, e.g. in terms of affinity, 
avidity, and the like. In a non-limiting example, more binding occurs in the presence of a 
competing reagent, such as casein. In another non-limiting example, antibodies that 
specifically bind a target protein should provide a detection signal at least 2-, 5-, 10-, or 20- 
fold higher relative to a detection signal provided with other molecules when used in Western 
blots or other immunochemical assays. In an alternative non-limiting example, a nucleic acid 
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can specifically bind its complementary nucleic acid molecule. In another non-limiting 
example, a transcription factor can specifically bind a particular nucleic acid sequence. 

As used herein, the term "secondary structure" means the alpha-helical, beta-sheet, 
random coil, beta turn structures and helical nucleic acid structures that occur in proteins, 
polypeptides, nucleic acids, compounds comprising modified nucleic acids, compounds 
comprising modified amino acids, and other types of compounds as a result of, at least, the 
compound's composition. 

As used herein, the term "non-peptide therapeutic agent" and analogous terms 
include, but are not limited to organic or inorganic compounds (i.e., including heteroorganic 
and organometallic compounds but excluding proteins, polypeptides and nucleic acids). 

As used herein, the term "UTR" refers to the untranslated region of a mRNA. 

As used herein, the tenn "untranslated region-dependent expression" or "UTR- 
dependent expression" refers to the regulation of gene expression through UTRs at the level 
of mRNA expression, i.e., after transcription of the gene has begun until the protein or the 
RNA product(s) encoded by the gene has degraded or excreted. 

As used herein, the term "vector" refers to a nucleic acid molecule used to introduce a 
nucleic acid sequence in a cell or organism. The entirety of the International Application 
PCT/US04/01643, filed January 21, 2004, including the sequence listing, is hereby 
incorporated by reference. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention includes and utilizes the fact that an untranslated region (UTR) 
is capable of modulating expression of a gene and that such modulation of expression is 
capable of being altered or modulated by the addition of compounds. In a preferred 
embodiment, a UTR is a region of a RNA that is not translated into protein. In a more 
preferred embodiment, a UTR is a flanking region of the RNA transcript that is not translated 
into the targeted protein, and can include a 5' UTR that has a short, putative open reading 
frame. In a most preferred embodiment, the UTR is a 5' UTR, i.e., upstream of the coding 
region, or a 3' UTR, i.e., downstream of the coding region. 

Moreover, the present invention includes and provides agents and methods useful in 
screening for a compound capable of modulating gene expression and also hybrid molecules. 
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Nucleic Acid Agents and Constructs 

One skilled in the art may refer to general reference texts for detailed descriptions of 
known techniques discussed herein or equivalent techniques. These texts include Ausubel et 
al, Current Protocols in Molecular Biology, John Wiley and Sons, Inc. (1995); Sambrook et 
al, Molecular Cloning, A Laboratory Manual (2d ed.), Cold Spring Harbor Press, Cold 
Spring Harbor, New York (1989); Birren et al, Genome Analysis: A Laboratory Manual, 
volumes 1 through 4, Cold Spring Harbor Press, Cold Spring Harbor, New York (1997- 
1999). These texts can, of course, also be referred to in making or using an aspect of the 
invention. 

UTRs 

The present invention includes nucleic acid molecules with UTRs that comprise or 
consist of a post-transcriptional regulatory element (PTCRE) including SEQ ID NO: 3, a 
negative regulator of a PTCRE (NeRP) including SEQ ID NO: 4, and fragments and 
complements of all. 

A PTCRE of the present invention can differ from any of the residues in SEQ ID NO: 
3 in that the nucleic acid sequence has been deleted, substituted, or added in a manner that 
does not alter the function. In another aspect of the present invention, a PTCRE of the 
present invention consists or comprises SEQ ID NO: 5, and fragments and complements of 
all. In another aspect of the present invention, a PTCRE of the present invention consists or 
comprises SEQ ID NOs: 6-8, and fragments and complements of all. 

A PTCRE of the present invention can differ from any of the residues in an 
untranslated region selected from the group consisting of a nucleic acid sequence consisting 
or comprising SEQ ID NO: 3 and SEQ ID NOs: 5-8 in that the nucleic acid sequence has 
been deleted, substituted, or added in a manner that does not alter the function. 

A NeRP of the present invention can differ from any of the residues in SEQ ID NO: 4 
in that the nucleic acid sequence has been deleted, substituted, or added in a manner that does 
not alter the function. 

In another aspect of the present invention, a NeRP of the present invention consists or 
comprises SEQ ID NO: 9, and fragments and complements of all. In another aspect of the 
present invention, a NeRP of the present invention consists or comprises SEQ ID NOs: 10- 
12, and fragments and complements of all. 
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A NeRP of the present invention can differ from any of the residues in a UTR selected 
from the group consisting of a nucleic acid sequence consisting or comprising SEQ ID NO: 4 
and SEQ ID NOs: 9-12 in that the nucleic acid sequence has been deleted, substituted, or 
added in a manner that does not alter the function. In one aspect, a NeRP is not a full-length 
sequence of a target UTR. In a preferred aspect, a NeRP is not a full-length VEGF 5' UTR. 

In one embodiment, a NeRP can be a nucleic acid sequence with a single base 
deletion at any location of SEQ ID NO: 4. Therefore, a NeRP of the invention may be 335 
bases. In another embodiment, a NeRP includes a nucleic acid sequence with two or more 
bases deleted from any location of SEQ ID NO: 4. In another embodiment, a NeRP includes 
3, 4, 5, 6, 7, 8, 9 or 10 residue deletions at any location of SEQ ID NO: 4. In another 
embodiment, a NeRP includes the remaining nucleic acid sequence resulting from the 
deletion of 20, 30, 40, 50, 60, or 70 residues from any location of SEQ ID NO: 4. In light of 
the specification, a NeRP of the present invention may be produced by contiguous, 
noncontiguous, or a combination of contiguous deletions and noncontiguous deletions of 
SEQ ID NO: 4 in a manner that does not alter the function of the NeRP. 

In a preferred embodiment, when a nucleic acid molecule, which includes a NeRP and 
a PTCRE, has been deleted, substituted, or added to in a manner that removes the negative 
regulation of a PTCRE, the secondary structure of the remaining nucleic acid molecule is 
altered in a manner comparable to the alteration in the secondary structure of a full-length 
VEGF 5' UTR when SEQ ID NO: 4 is deleted. In a more preferred embodiment, a NeRP of 
the present invention has a secondary structure comparable to the secondary structure of SEQ 
ID NO: 4. 

In an embodiment of the present invention, the presence of a NeRP can be detected by 
the deletion, insertion or alteration of one or more pseudoknots from a larger nucleic acid 
molecule. In another embodiment, the deletion, insertion or alteration of a stem-loop 
structure from a larger nucleic acid molecule of the present invention results in a NeRP of the 
present invention. Programs such as mfold, {see the world wide web at 
bioweb.pasteur.fr/seqanal/interfaces/mfold-simple.html) genebee {see the world wide web at 
genebee.msu.su/) may be used to ascertain the secondary structure of SEQ ID NO: 4 and 
other nucleic acid molecules of the present invention. Other programs or methods well 
known to those of skill in the art can also be employed. 



12 



WO 2005/118857 



PCT/US2004/020751 



The present invention also includes a NeRP that has a tertiary structure comparable to 
the tertiary structure formed by SEQ ID NO: 4. Tertiary structure can be determined by, for 
example, crystallography and phylogenetic covariation (as reviewed in Martin I, et al, 
Biochim BiophysActa. 2003 Jul 11;1614(1):97-103; Heinemann U., et al, Biol Chem. 1996 
Jul-Aug;377(7-8):447-54). 

The present invention provides nucleic acid molecules that hybridize to the above- 
described nucleic acid molecules. In a preferred aspect, the nucleic acid molecule hybridizes 
to a nucleic acid molecule selected from the group consisting of a nucleic acid sequence 
consisting or comprising SEQ ID NOs: 3-12, and complements thereof. Nucleic acid 
hybridization is a technique well known to those of skill in the art of DNA manipulation. The 
hybridization properties of a nucleic acid molecule are an indication of their similarity or 
identity. The nucleic acid molecules preferably hybridize, under moderate or high stringency 
conditions, with a nucleic acid sequence selected from SEQ ID NO: 5 and complements 
thereof. Fragments of these sequences are also contemplated. 

In another aspect, the nucleic acid molecules preferably hybridize, under moderate or 
high stringency conditions, with a nucleic acid sequence selected from the group consisting 
of SEQ ID NO: 5 and its complement. 

The hybridization conditions typically involve nucleic acid hybridization in about 
0.1X to about 10X SSC (diluted from a 20X SSC stock solution containing 3 M sodium 
chloride and 0.3M sodium citrate, pH 7.0 in distilled water), about 2.5X to about 5X 
Denhardt's solution (diluted from a 50X stock solution containing 1% (w/v) bovine serum 
albumin, 1% (w/v) Ficoll® (Amersham Biosciences Inc., Piscataway, NJ), and 1% (w/v) 
polyvinylpyrrolidone in distilled water), about 10 mg/ml to about 100 mg/ml salmon sperm 
DNA, and about 0.02% (w/v) to about 0.1% (w/v) SDS, with an incubation at about 20° C to 
about 70° C for several hours to overnight. 

In a preferred aspect, the moderate stringency hybridization conditions are provided 
by 6X SSC, 5X Denhardt's solution, 100 mg/ml salmon sperm DNA, and 0.1% (w/v) SDS, 
with an incubation at 55° C for several hours. The moderate stringency wash conditions are 
about 0.02% (w/v) SDS, with an incubation at about 55° C overnight. In a more preferred 
aspect, the high stringency hybridization conditions are about 2X SSC, about 3X Denhardt's 
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solution, and about 10 mg/ml salmon sperm DNA. The high stringency wash conditions are 
about 0.05% (w/v) SDS, with an incubation at about 65° C overnight. 

In an embodiment, the nucleic acid molecule comprises a nucleic acid sequence that is 
greater than 85% identical, and more preferably greater than 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, or 99% identical to a nucleic acid sequence selected from the group 
consisting of SEQ ID NOs: 3-12, complements thereof, and fragments of any of these 
sequences. 

The percent identity is preferably determined using the "Best Fit" or "Gap" program 
of the Sequence Analysis Software Package™ (Version 10; Genetics Computer Group, Inc., 
University of Wisconsin Biotechnology Center, Madison, WI). "Gap" utilizes the algorithm 
of Needleman and Wunsch to find the alignment of two sequences that maximizes the 
number of matches and minimizes the number of gaps. "BestFit" performs an optimal 
alignment of the best segment of similarity between two sequences and inserts gaps to 
maximize the number of matches using the local homology algorithm of Smith and 
Waterman. The percent identity calculations may also be performed using the Megalign 
program of the LASERGENE bioinfonnatics computing suite (default parameters, 
DNASTAR Inc., Madison, Wisconsin). The percent identity is most preferably determined 
using the "Best Fit" program using default parameters. 

Fragment nucleic acid molecules can contain significant portions of, or indeed most 
of, SEQ ID NO: 3. In an embodiment, the fragments are between about 160 and 250 
consecutive residues, about 260 and about 350 consecutive residues, about 360 and about 400 
consecutive residues, or about 460 and 500 consecutive residues long of a nucleic molecule 
of the present invention. In another embodiment, the fragment comprises at least 170, 300, 
500, or 600 consecutive residues of SEQ ID NO: 3. In a particularly preferred embodiment, a 
fragment nucleic acid molecule is capable of selectively hybridizing to SEQ ID NO: 3. 

In one embodiment, a PTCRE comprises or consists of SEQ ID NO: 3. In another 
embodiment, a PTCRE comprises or consists of a fragment of SEQ ID NO: 3. In a preferred 
embodiment, a PTCRE can share identity with between particular mammals, including, but 
not limited to human, mouse, and rat. In another preferred embodiment, a PTCRE is unique 
to a human VEGF 5' UTR PTCRE, a non-limiting example of which is SEQ ID NO: 3. 
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Fragment nucleic acid molecules can contain significant portions of SEQ ID NO: 4. 
In another embodiment of the present invention, nucleic acid molecules can comprise or 
consist of significant portions of SEQ ID NO: 4. In an embodiment, the fragments are 
between about 40 and about 90 consecutive residues, about 100 and about 150 consecutive 
residues, about 160 and about 250 consecutive residues, or about 260 and 325 consecutive 
residues long of a nucleic molecule of the present invention. In another embodiment, the 
fragment comprises at least 90, 150, 250, or 325 consecutive residues of SEQ ID NO: 4. In a 
preferred embodiment, a fragment nucleic acid molecule is capable of selectively hybridizing 
to SEQ ID NO: 4. 

In one embodiment, a NeRP comprises or consists of SEQ ID NO: 4. In another 
embodiment, a NeRP comprises or consists of a fragment of SEQ ID NO: 4. In a preferred 
embodiment, a NeRP can share identity with between particular mammals, including, but not 
limited to human, mouse, and rat. In another preferred embodiment, a NeRP is unique to a 
human VEGF 5' UTR NeRP, a non-limiting example of which is SEQ ID NO: 4. 

Any of a variety of methods may be used to obtain one or more of the above- 
described nucleic acid molecules of the present invention. Automated nucleic acid 
synthesizers may be employed for this purpose. In lieu of such synthesis, the disclosed 
nucleic acid molecules may be used to define a pair of primers that can be used with the 
polymerase chain reaction (PCR) to amplify and obtain any desired nucleic acid molecule or 
fragment. 

Short nucleic acid sequences having the ability to specifically hybridize to 
complementary nucleic acid sequences may be produced and utilized in the present invention, 
e.g., as probes to identify the presence of a complementary nucleic acid sequence in a given 
sample. Alternatively, the short nucleic acid sequences may be used as oligonucleotide 
primers to amplify or mutate a complementary nucleic acid sequence using PCR technology. 
These primers may also facilitate the amplification of related complementary nucleic acid 
sequences (e.g., related sequences from other species). 

Use of these probes or primers may greatly facilitate the identification of transgenic 
cells or organisms that contain the presently disclosed structural nucleic acid sequences. 
Such probes or primers may also, for example, be used to screen cDNA, mRNA, or genomic 
libraries for additional nucleic acid sequences related to or sharing homology with the 
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presently disclosed promoters and structural nucleic acid sequences. The probes may also be 
PCR probes, which are nucleic acid molecules capable of initiating a polymerase activity 
while in a double-stranded structure with another nucleic acid. 

A primer or probe is generally complementary to a portion of a nucleic acid sequence 
that is to be identified, amplified, or mutated and of sufficient length to form a stable and 
sequence-specific duplex molecule with its complement. The primer or probe preferably is 
about 10 to about 200 residues long, more preferably is about 10 to about 100 residues long, 
even more preferably is about 10 to about 50 residues long, and most preferably is about 14 
to about 30 residues long. 

The primer or probe may, for example without limitation, be prepared by direct 
chemical synthesis, by PCR (U.S. Patent Nos. 4,683,195 and 4,683,202), or by excising the 
nucleic acid specific fragment from a larger nucleic acid molecule. Various methods for 
determining the sequence of PCR probes and PCR techniques exist in the art. Computer- 
generated searches using programs such as Primer3 (www-genome.wi.mit. edu/cgi- 
bin/primer/primer3.cgi), STSPipeline (www-genome.wi.mit.edu/cgi-bin/www- 
STS_Pipeline), or GeneUp (Pesole et at, BioTechniques 25:1 12-123, 1998), for example, can 
be used to identify potential PCR primers. 

Furthermore, sequence comparisons can be done to find nucleic acid molecules of the 
present invention based on secondary structure homology. Several methods and programs 
are available to predict and compare secondary structures of nucleic acid molecules, for 
example, GeneBee (available on the world wide web at 

genebee.msu.su/services/rna2_reduced.html); the Vienna RNA Package (available on the 
world wide web at tbi.univie.ac.at/~ivo/RNA/); SstructView (available on the world wide 
web at the Stanford Medical Informatics website, under: 

projects/helix/sstructview/home.html and described in "RNA Secondary Structure as a 
Reusable Interface to Biological Information Resources." 1997. Gene vol. 190GC59-70). For 
example, comparisons of secondary structure are preformed in Le et al. , A common RNA 
structural motif involved in the internal initiation of translation of cellular mRNAs. 1997. 
Nuc. Acid. Res. vol. 25(2):362-369. 

Constructs of the Present Invention 
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The present invention includes and provides nucleic acid constructs. It is understood 
that any of the constructs and other nucleic acid agents of the present invention can be either 
DNA or RNA. In a preferred embodiment, a construct can be a nucleic acid molecule having 
a UTR, a coding sequence, or both. In another embodiment, a construct is composed of at 
least one UTR of the present invention, a sequence encoding a reporter polypeptide, and a 
vector. Moreover, any of the nucleic acid molecules of the present invention can be used in 
combination with a method of the present invention. 

Vectors 

Exogenous genetic material may be introduced into a host cell by use of a vector or 
construct designed for such purpose. Any of the nucleic acid sequences of the present 
invention can be incorporated into a vector or construct of the present invention. A vector or 
construct of the present invention includes, without limitation, linear or closed circular 
plasmids. A vector system may be a single vector or plasmid or two or more vectors or 
plasmids that together contain the total DNA to be introduced into the genome of the host. In 
a preferred embodiment, a vector contains a promoter functional in mammalian cells or 
bacteria or both. Means for preparing vectors or constructs are well known in the art. 

Vectors suitable for replication in mammalian cells may include viral replicons, or 
sequences that insure integration of the appropriate sequences encoding HCV epitopes into 
the host genome. For example, another vector used to express foreign DNA is vaccinia virus. 
Such heterologous DNA is generally inserted into a gene that is non-essential to the virus, for 
example, the thymidine kinase gene (tk), which also provides a selectable marker. 
Expression of the HCV polypeptide then occurs in cells or animals that are infected with the 
live recombinant vaccinia virus. 

In general, plasmid vectors containing replicon and control sequences that are derived 
from species compatible with the host cell are used in connection with bacterial hosts. The 
vector ordinarily carries a replication site, as well as marking sequences that are capable of 
providing phenotypic selection in transformed cells. For example, E. coli is typically 
transformed using a construct with a backbone derived from a vector, such as pBR322, which 
contains genes for ampicillin and tetracycline resistance and thus provides easy means for 
identifying transformed cells. The pBR322 plasmid, or other microbial plasmid or phage, 
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also generally contains, or is modified to contain, promoters that can be used by the microbial 
organism for expression of the selectable marker genes. 

In a preferred embodiment of the present invention, an expression vector can be a 
high-level mammalian expression vector designed to randomly integrate into the genome, for 
example, pCMRl . In another preferred embodiment of the present invention, an expression 
vector can be a high-level mammalian expression vector designed to site-specifically 
integrate into the genome of cells. For example, pMCPl can site-specifically integrate into 
the genome of cells genetically engineered to contain the FRT site-specific recombination site 
via the Flp recombinase (see, e.g., Craig, 1988, Ann. Rev. Genet. 22: 77-105; and Sauer, 
1994, Curr. Opin. Biotechnol. 5: 521-527). 

Promoters 

A construct can include a promoter, e.g., a recombinant vector typically comprises, in 
a 5' to 3' orientation: a promoter to direct the transcription of a nucleic acid molecule of 
interest. 

In a preferred aspect of the present invention, a construct can include a mammalian 
promoter and can be used to express a nucleic acid molecule of choice. As used herein, a 
"mammalian promoter" refers to a promoter functional in a mammalian cell derived from a 
mammalian cell or both. A number of promoters that are active in mammalian cells have 
been described in the literature. A promoter can be selected on the basis of the cell type into 
which the vector will be inserted. 

A preferred promoter of the present invention is a VEGF promoter. In addition to 
VEGF promoters described previously, other promoter sequences can be utilized in a 
construct or other nucleic acid molecule. Suitable promoters include, but are not limited to, 
those described herein. 

Suitable promoters for mammalian cells are known in the art and include viral 
promoters, such as those from Simian Virus 40 (SV40), Rous sarcoma virus (RSV), 
adenovirus (ADV), cytomegalovirus (CMV), and bovine papilloma virus (BPV), and the 
parvovirus B19p6 promoter as well as mammalian cell-derived promoters. A number of 
viral-based expression systems can be used to express a reporter gene in mammalian host 
cells. For example, if an adenovirus is used as an expression vector, sequences encoding a 
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reporter gene can be ligated into an adenovirus transcription/translation complex comprising 
the late promoter and tripartite leader sequence. 

Other examples of preferred promoters include tissue-specific promoters and 
inducible promoters. Other preferred promoters include the hematopoietic stem cell-specific, 
e.g., CD34, glucose-6-phosphotase, interleukin-1 alpha, CD 11c integrin gene, GM-CSF, 
interleukin-5R alpha, interleukin-2, c-fos, h-ras and DMD gene promoters. Other promoters 
include the herpes thymidine kinase promoter, and the regulatory sequences of the 
metallothionein gene. 

Inducible promoters suitable for use with bacteria hosts include the P-lactamase and 
lactose promoter systems, the arabinose promoter system, alkaline phosphatase, a tryptophan 
(trp) promoter system and hybrid promoters such as the tac promoter. However, other known 
bacterial inducible promoters are suitable. Promoters for use in bacterial systems also 
generally contain a Shine-Dalgarno sequence operably linked to the DNA encoding the 
polypeptide of interest. 

A promoter can also be selected on the basis of their regulatory features, e.g., 
enhancement of transcriptional activity, inducibility, tissue specificity, and developmental 
stage-specificity. A promoter can work in vitro, for example the T7-promoter. Particularly 
preferred promoters can also be used to express a nucleic acid molecule of the present 
invention in a nonhuman mammal. Additional promoters that may be utilized are described, 
for example, in Bernoist and Chambon, Nature 290:304-310 (1981); Yamamoto et al., Cell 
22:787-797 (1980); Wagner et al, PNAS 78:1441-1445 (1981); Brinster et al., Nature 
296:39-42(1982). 

Reporter genes 

As used herein, a "reporter gene" is any gene whose expression can be measured. In a 
preferred embodiment, a reporter gene does not have any UTRs. In a more preferred 
embodiment, a reporter gene is a contiguous open reading frame. In another preferred 
embodiment, a reporter gene can have a previously determined reference range of detectable 
expression. 

Constructs of the invention can comprise one or more reporter genes fused to one or 
more UTRs. For example, specific RNA sequences, RNA structural motifs, and/or RNA 
structural elements that are known or suspected to modulate UTR-dependent expression of a 
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target gene can be fused to the reporter gene. A reporter gene of the present invention 
encoding a protein, a fragment thereof, or a polypeptide, can also be linked to a propeptide 
encoding region. A propeptide is an amino acid sequence found at the amino terminus of a 
proprotein or proenzyme. Cleavage of the propeptide from the proprotein yields a mature 
biochemically active protein. The resulting polypeptide is known as a propolypeptide or 
proenzyme (a zymogen in some cases). Propolypeptides are generally inactive and can be 
converted to mature active polypeptides by catalytic or autocatalytic cleavage of the 
propeptide from the propolypeptide or proenzyme. 

A reporter gene can express a selectable or screenable marker. Selectable markers 
may also be used to select for organisms or cells that contain exogenous genetic material. 
Examples of such include, but are not limited to: a neo gene, which codes for kanamycin 
resistance and can be selected for using kanamycin, GUS, neomycin phosphotransferase II 
(nptIT), or an antibiotic resistance coding sequence. Screenable markers can be used to 
monitor expression. Exemplary screenable markers include: green fluorescent protein (GFP), 
luciferase (LUX), a P-glucuronidase or itidA gene (GUS) which encodes an enzyme for 
which various chromogenic substrates are known; a P-lactamase gene, a gene which encodes 
an enzyme for which various chromogenic substrates are known (e.g., PAD AC, a 
chromogenic cephalosporin); a luciferase gene; a tyrosinase gene, which encodes an enzyme 
capable of oxidizing tyrosine to DOPA and dopaquinone which in turn condenses to melanin; 
and a-galactosidase, which can turn a chromogenic a-galactose substrate. 

Included within the terms "selectable or screenable marker genes" are also genes that 
encode a secretable marker whose secretion can be detected as a means of identifying or 
selecting for transformed cells. Examples include markers that encode a secretable antigen 
that can be identified by antibody interaction, or even secretable enzymes, which can be 
detected utilizing their inherent properties. Secretable proteins fall into a number of classes, 
including small, diffusible proteins which are detectable, (e.g., by ELISA), or small active 
enzymes which are detectable in extracellular solution (e.g., a-amylase, p-lactamase, 
phosphinothricin transferase). Other possible selectable or screenable marker genes, or both, 
are apparent to those of skill in the art. 

A reporter gene can express a fusion protein. As such, the fusion protein can be a 
fusion of any reporter gene operably linked to another gene, or fragment thereof. For 
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instance, the expressed fusion protein can provide a "tagged" epitope to facilitate detection of 
the fusion protein, such as GST, GFP, FLAG, or polyHIS. Such fusions preferably encode 
between 1 and 50 amino acids, more preferably between 5 and 30 additional amino acids, and 
even more preferably between 5 and 20 amino acids. In one embodiment, a fusion protein 
can be a fusion protein that includes in whole or in part of a VEGF protein sequence. 

Alternatively, the fusion can provide regulatory, enzymatic, cell signaling, or 
intercellular transport functions. For example, a sequence encoding a signal peptide can be 
added to direct a fusion protein to a particular organelle within a eukaryotic cell. Such fusion 
partners preferably encode between 1 and 1000 additional amino acids, more preferably 
between 5 and 500 additional amino acids, and even more preferably between 10 and 250 
amino acids. 

The present invention also provides for a reporter gene flanked by one or more 
untranslated regions (e.g., the 5' UTR, 3' UTR, or both the 5' UTR and 3' UTR of the target 
gene). In addition, the present invention provides for a reporter gene flanked by one or more 
UTRs of a target gene, where the UTR contains one or more mutations (e.g., one or more 
substitutions, deletions and/or additions). In a preferred embodiment, the reporter gene is 
flanked by both 5' and 3' UTRs so that compounds that interfere with an interaction between 
the 5' and 3' UTRs can be identified. 

In another preferred embodiment, a stable hairpin secondary structure is inserted into 
the UTR, preferably the 5' UTR of the target gene. For example, in cases where the 5' UTR 
possesses IRES activity, the addition of a stable hairpin secondary structure in the 5' UTR can 
be used to separate cap-dependent from cap-independent translation (see, e.g., Muhlrad et al., 
1995, Mol. Cell. Biol. 15(4):2145-56, the disclosure of which is incorporated by reference in 
its entirety). In another embodiment, an intron is inserted into a UTR (preferably, the 5' 
UTR) or at the 5' end of an ORF of a target gene. For example, but not by limitation, in cases 
where an RNA possesses instability elements, an intron, e.g., the human elongation factor one 
alpha (EF-1 alpha) first intron, can be cloned into a UTR (preferably, the 5' UTR) or a 5' end 
of the ORF to increase expression (see, e.g., Kim et al., 2002, J Biotechnol 93(2):183-7, the 
disclosure of which is incorporated by reference in its entirety). In a preferred embodiment, 
both a stable hairpin secondary structure and an intron are added to the reporter gene 
construct. In a more preferred embodiment, the stable hairpin secondary structure is cloned 
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into the 5' UTR and the intron is added at the 5' end of the main ORF of the reporter gene. 

The reporter gene can be positioned such that the translation of that reporter gene is 
dependent upon the mode of translation initiation, such as, but not limited to, cap-dependent 
translation or cap-independent translation (i.e., translation via an internal ribosome entry 
site). Alternatively, where the UTR contains an upstream open reading frame, the reporter 
gene can be positioned such that the reporter protein is translated only in the presence of a 
compound that shifts the reading frame of the UTR so that the formerly untranslated open 
reading frame is then translated. 

The reporter gene constructs can be monocistronic or multicistronic. A multicistronic 
reporter gene construct may encode 2, 3, 4, 5, 6, 7, 8, 9, 10 or more, or in the range of 2-5, 5- 
10 or 10-20 reporter genes. For example, a dicistronic reporter gene construct comprising, in 
the following order going downstream, a promoter, a first reporter gene, a 5' UTR of a target 
gene, a second reporter gene and optionally, a 3* UTR of a target gene. In such a reporter 
construct, the transcription of both reporter genes is capable of being driven by the promoter. 
In this example construct, the present invention includes the translation of the mRNA from 
the first reporter gene by a cap-dependent scanning mechanism and the translation of the 
mRNA from the second reporter gene by a cap-independent mechanism, for example by an 
IRES. In such a case, the IRES-dependent translation of a mRNA of the second reporter gene 
can be normalized against the cap-dependent translation. 

Reporter genes can be expressed in vitro or in vivo. In vivo expression can be in a 
suitable bacterial or eukaryotic host. Suitable methods for expression are described by 
Sambrook et al, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); Haymes et al, Nucleic Acid 
Hybridization, A Practical Approach, IRL Press, Washington, DC (1985); or similar texts. 
Fusion protein or peptide molecules of the invention are preferably produced via recombinant 
means. These proteins and peptide molecules can be derivatized to contain carbohydrate or 
other moieties (such as keyhole limpet hemocyanin, etc.). 
Linked 

As used herein, linked means physically linked, operably linked, flanked, or any of 
these in combination. 

As used herein, physically linked means that the physically linked nucleic acid 
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sequences are located on the same nucleic acid molecule, for example a promoter can be 
physically linked to a reporter gene as part of a construct. A physical linkage can be 
proximal, and either direct or indirect. In a preferred embodiment, the promoter is operably 
linked and physically linked to a nucleic acid sequence of the present invention. 

A preferred embodiment of the present invention also provides for specific nucleic 
acid molecules containing a reporter gene flanked by one or more UTRs of a target gene. In 
this preferred embodiment, the one or more UTRs of a target gene can be physically linked, 
operably linked, or operably and physically linked to the reporter gene. The present 
invention also provides for a reporter gene flanked by one or more UTRs of a target gene, 
where one or more of the UTRs contains one or more mutations (e.g., one or more of each 
substitution, deletion, addition, or any combination of each). In a more preferred 
embodiment of the present invention, a reporter gene is flanked by a 5' UTR of a VEGF gene 
containing one or more deletions. In a most preferred embodiment, a reporter gene is flanked 
by and operably linked to a 5' VEGF UTR in the absence of aNeRP. 

In a preferred embodiment, the reporter gene is flanked by both 5' and 3' UTRs of 
one or more target genes so that compounds that interfere with an interaction between the 5' 
and 3' UTRs can be identified. In a more preferred embodiment, the reporter gene is flanked 
by a 5' and 3' UTRs of one target gene, and the reporter gene is physically, operably, or 
physically and operably linked to the UTRs of one target gene. In a most preferred 
embodiment, a reporter gene is proximally linked, either directly or indirectly, to one or more 
UTRs of a target gene. If the reporter gene is directly linked to a UTR of a target gene, the 
last nucleic acid residue of the reporter gene is chemically bonded to the first nucleic acid 
residue of the UTR of a target gene. If the reporter gene is proximally linked indirectly to a 
UTR of a target gene, the last nucleic acid residue of the reporter gene is not chemically 
bonded to the first nucleic acid residue of the UTR of a target gene and the last nucleic acid 
residue of the reporter gene can be about 3 or greater than 5 but less than 20. If the reporter 
gene is directly linked to a UTR of a target gene at any time during reporter gene processing, 
such as after a splicing event, the reporter gene is directly linked to the UTR. 
UTRs 

Agents and constructs of the invention include nucleic acid molecules with an 
untranslated region (UTR). In a preferred aspect, a UTR refers to a UTR of an mRNA, i. e. 
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the region of the mRNA that is not translated into protein. In a preferred embodiment, a UTR 
contains one or more regulatory elements that modulate UTR-dependent regulation of gene 
expression. In a particularly preferred embodiment, a UTR is a 5' UTR, i.e., upstream of the 
coding region, or a 3' UTR, i.e., downstream of the coding region. In another particularly 
preferred embodiment, the 5' UTR includes a VEGF promoter. In a more preferred 
embodiment, the 5' UTR includes a VEGF promoter and a PTCRE. 

As used herein, a "main ORF" is a nucleic acid sequence, including sequence 
contained in deoxyribonucleic acid and ribonucleic acid molecules, having an open reading 
frame that can be translated. Examples of a main ORF include a reporter gene, a target gene, 
and a control gene. As used herein, a "target gene" can be any gene. In a preferred 
embodiment, a target gene is a gene operatively linked downstream of a VEGF 5' UTR 
containing an upstream open reading frame ("uORF"). In another embodiment, a target gene 
can be a VEGF main ORF. In a preferred embodiment, a target gene is a gene containing a 
uORF. In a particularly preferred embodiment, a target gene is a gene having greater than 
50% identity greater than 200 residues with respect to a VEGF gene. As used herein, a 
"control gene" can be any gene. In a preferred embodiment, a control gene is a gene 
operatively linked downstream of a VEGF 5' UTR that does not contain a NeRP. 

A UTR of the present invention can be operatively, physically, or operatively and 
physically linked to a reporter gene. In a preferred embodiment of the present invention, a 
UTR of the present invention is physically linked to a reporter gene. The physical, operable, 
or physical and operable linkage may be upstream, downstream, or internal to the reporter 
gene. As used herein, operably linked means that the operably linked nucleic acid sequences 
exhibit their deserved function. For example, a promoter can be operably linked to a reporter 
gene. 

In a preferred aspect of the present invention, a UTR of the present invention is a 
VEGF 5' UTR physically linked upstream of a reporter gene. In a particularly preferred 
embodiment, VEGF 5' UTR contains or consists of SEQ ID NO: 3 and is physically linked 
downstream of a reporter gene. In another embodiment, a VEGF 5' UTR does not contain or 
consist of a NeRP and is physically and operatively linked upstream of a reporter gene. In a 
more particularly preferred embodiment, VEGF 5' UTR does not contain or consist of a 
NeRP and does contain or consist of a PTCRE and is physically and operatively linked 
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upstream of a reporter gene. 

In a preferred embodiment of the present invention, a UTR of the present invention is 
physically linked upstream to a reporter gene and another UTR is physically linked 
downstream of the reporter gene. In a particularly preferred embodiment, a UTR of the 
present invention contains or consists of SEQ ID NO: 3 and is physically and operatively 
linked upstream of a reporter gene and a VEGF 3' UTR is physically and operatively linked 
downstream of a reporter gene. 

In a preferred embodiment of the present invention, a UTR of the present invention is 
physically linked to reporter gene containing an intron. In a more preferred embodiment of 
the present invention, a UTR of the present invention containing SEQ ID NO: 3 is physically 
linked to a reporter gene containing an intron. In a preferred embodiment of the present 
invention, a UTR of the present invention is physically linked upstream of a reporter gene 
and contains an intron internal to the UTR. 

In a preferred embodiment of the present invention, a UTR of the present invention is 
physically linked upstream of a reporter gene and a UTR is physically linked downstream of 
the reporter gene. In a more preferred embodiment of the present invention, a VEGF 5' UTR 
of the present invention containing a SEQ ID NO: 3 is physically linked upstream of a 
reporter gene and a VEGF 3' UTR is physically linked downstream of the reporter gene. 

PTCREs and NeRPs 

As referred to herein, a PTCRE is a post-transcriptional regulatory element that 
modulates expression of a target gene. In one aspect, a PTCRE is not a full-length sequence 
of a target UTR. In a preferred aspect, a PTCRE is not a full-length VEGF 5' UTR. In one 
embodiment, a PTCRE in one target gene can have primary nucleic acid sequence similarity 
to a PTCRE in a different target gene. Alternatively, there may not be any primary nucleic 
acid sequence similarity in PTCREs of similar function. In a preferred embodiment, a 
PTCRE in one target gene can have a secondary, tertiary, or secondary and tertiary structure 
similar to a PTCRE in a different target gene. Examples of PTCREs include, but are not 
limited to, IRES elements, upstream ORFs, and AREs. 

In one embodiment, a PTCRE is a nucleic acid sequence in a UTR, which modulates 
UTR-dependent expression. A PTCRE can be a nucleic acid sequence selected from the 
group consisting of an iron response element ("IRE"), Internal ribosome entry site ("IRES"), 
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upstream open reading frame ("uORF"), male specific lethal element ("MSL-2"), G quartet 
element, and 5 '-terminal oligopyrimidine tract ("TOP"), AU-rich element ("ARE"), 
selenocysteine insertion sequence ("SECIS"), histone stem loop, cytoplasmic polyadenylation 
element ("CPE"), nanos translational control element, amyloid precursor protein element 
("APP"), translational regulation element ("TGE")/direct repeat element ("DRE"), Bruno 
element ("BRE"), or a 15 -lipoxygenase differentiation control element ("15-LOX-DICE"). In 
an alternative embodiment, a PTCRE is not an IRES from VEGF. In another embodiment, 
PTCRE is not be an IRES. In a preferred embodiment, a PTCRE is SEQ ID NO: 3. In a 
most preferred embodiment, a PTCRE is not a NeRP and the PTCRE does not contain or 
consist of a NeRP. 

A NeRP is a nucleic acid sequence in a UTR, which modulates PTCRE-dependent 
expression in a NeRP-dependent mechanism. In one embodiment of the present invention, a 
NeRP regulates an IRES. In a preferred embodiment, a NeRP suppresses IRES-dependent 
expression of a gene. In a most preferred embodiment, a NeRP is SEQ ID NO: 4. 
Alternatively, a NeRP can modulate PTCRE-dependent expression, where the PTCRE 
suppresses gene expression, so that the NeRP is capable of increasing gene expression by a 
NeRP-dependent mechanism. In an alternative embodiment of the invention, a NeRP mimics 
an IRES. 

A NeRP of the present invention can differ from any of the residues in SEQ ID NO: 4 
in that the nucleic acid sequence has been deleted, substituted, or added in a manner that does 
not alter the function. In another aspect of the present invention, a NeRP of the present 
invention consists or comprises SEQ ID NO: 9, and fragments and complements of all. In 
another aspect of the present invention, a NeRP of the present invention consists or comprises 
SEQ ID NOs: 10-12, and fragments and complements of all. 

While the present invention is directed, in part, to VEGF 5' UTRs, PTCREs of the 
present invention can be located in any position within a construct and not limited to the 5' 
UTR region of a construct. A PTCRE of the present invention can be operatively, physically, 
or operatively and physically linked to a UTR. In an alternative embodiment of the present 
invention, a PTCRE of the present invention is a UTR of the present invention. 

While the present invention is directed, in part, to a NeRP in the VEGF 5' UTR, 
NeRPs of the present invention can be located in any position within a construct and not 
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limited to the VEGF 5' UTR region of a construct. A NeRP of the present invention can be 
operatively, physically, or operatively and physically linked to a UTR. In an alternative 
embodiment of the present invention, a NeRP of the present invention is upstream of a 
PTCRE of the present invention. 

In a preferred embodiment, a PTCRE of the present invention is located between 
about 1 to about 1 00 residues upstream from the initiation codon of an open reading frame in 
a mRNA, between about 150 to about 250 residues upstream from the initiation codon, or 
between about 300 to about 500 residues upstream from the initiation codon. In a most 
preferred embodiment, the untranslated region is about 1 residue upstream from the initiation 
codon. 

In a preferred embodiment, a NeRP of the present invention is between about 1000 to 
about 500 residues upstream from a PTCRE, between about 500 to about 100 residues 
upstream from a PTCRE, or between about 100 to about 60 residues upstream from a 
PTCRE. In another embodiment, a PTCRE is within about 1000 residues upstream from the 
5' end of a main ORF, about 500 residues upstream from the 5' end of a main ORF, or within 
about 200 residues upstream from the 5' end of a main ORF, or about 100 residues upstream 
from the 5' end of a main ORF. In another embodiment, a PTCRE is within the main ORF 
and between about 1 000 to about 500 residues upstream from the 3' end of a main ORF, 
between about 500 to about 100 residues upstream from the 3' end of a main ORF, or 
between about 1 00 to about 60 residues upstream from the 3 ' end of a main ORF. In a most 
preferred embodiment, the PTCRE is within 30 residues upstream from the 5' end of a main 
ORF. 

Constructs of the present invention can have more or fewer components than 
described above. For example, constructs of the present invention can include genetic 
elements, including but not limited to, 3' transcriptional terminators, 3' polyadenylation 
signals, other untranslated nucleic acid sequences, transit or targeting sequences, selectable or 
screenable markers, promoters, enhancers, and operators, as desired. Constructs of the 
present invention can also contain a promoterless gene that may utilize an endogenous 
promoter upon insertion into a host cell chromosome. 

Alternatively, sequences encoding nucleic acid molecules of the present invention can 
be cloned into a vector for the production of an mRNA probe. Such vectors are known in the 
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art, are commercially available, and can be used to synthesize RNA probes in vitro by 
addition of labeled nucleotides and an appropriate RNA polymerase such as T7, T3, or SP6. 
These procedures can be conducted using a variety of commercially available kits (for 
example, Amersham Biosciences Inc., Piscataway, NJ; and Promega Co, Madison, WT). 

Modulation of Gene Expression by Nucleic Acid Molecules of the Present Invention 

Modulation of gene expression can result in more or less gene expression. Many 
approaches for modulating gene expression using nucleic acid molecules of the present 
invention are known to one skilled in the art. For example, over-expression of a gene product 
can be the result from transfection of a construct of the present invention into a mammalian 
cell. Similarly, down-regulation can be the result from transfection of a construct of the 
present invention into a mammalian cell. Other non-limiting examples include anti-sense 
techniques like RNA interference (RNAi), transgenic animals, hybrids, and ribozymes. The 
following examples are provided by way of illustration, and are not intended to be limiting of 
the present invention. 

Cellular Mechanisms 

As used herein, the term "UTR-dependent expression" refers to the regulation of gene 
expression through UTRs at the level of mRNA expression, i.e., after transcription of the 
gene has begun until the protein or the RNA product(s) encoded by the gene has degraded. In 
a preferred embodiment, the term "UTR-dependent expression" refers to the regulation of 
mRNA stability or translation. In a more preferred embodiment, the term "UTR-dependent 
expression" refers to the regulation of gene expression through regulatory elements present in 
an UTR(s). Altering the sequence of a PTCRE within a UTR of target gene may change the 
amount of UTR-dependent expression observed for that target gene. 

As used herein, a "UTR-affected mechanism" is a cellular mechanism that 
discriminates between UTRs based on their nucleic acid sequence or based on properties that 
are a function of their sequence such as the secondary, tertiary, or quaternary structure or 
other associated factors. Modulation of the UTR-dependent expression of a target gene can 
be due to a change in how a UTR-affected mechanism acts on the target gene. For example, 
a UTR in a target gene can contain an IRES, which affects target gene expression via a UTR- 
affected mechanism. 
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In a preferred embodiment, a UTR-affected mechanism can be a main ORF- 
independent mechanism. As used herein, a "main ORF-independent mechanism" refers to a 
cellular pathway or process, wherein at least one step relates to gene expression and is not 
dependent on the nucleic acid sequence of the main open reading frame. In a preferred 
embodiment, a UTR-affected mechanism is a main ORF-independent, UTR-affected 
mechanism. 

In order to exclude the possibility that a particular compound is functioning solely by 
modulating the expression of a target gene in an UTR-independent manner, one or more 
mutations may be introduced into the UTRs operably linked to a reporter gene and the effect 
on the expression of the reporter gene in a reporter gene-based assay described herein can be 
determined. For example, a reporter gene construct comprising the 5' UTR of a target gene 
may be mutated by deleting a fragment of the 5' UTR of the target gene or substituting a 
fragment of the 5' UTR of the target gene with a fragment of the 5' UTR of another gene and 
measuring the expression of the reporter gene in the presence and absence of a compound 
that has been identified in a screening assays of the present invention or of an assay well 
known to the skilled artisan. If the deletion of a fragment of the 5' UTR of the target gene or 
the substitution of a fragment of the 5' UTR of the target gene with a fragment of the 5' UTR 
of another gene affects the ability of the compound to modulate the expression of the reporter 
gene, then the fragment of the 5' UTR deleted or substituted plays a role in the regulation of 
the reporter gene expression and the regulation, at least in part, in an UTR-dependent manner. 

Alternatively or in conjunction with tests described above, the possibility that a 
particular compound is functioning solely by modulating the expression of a target gene in an 
UTR-independent manner can be determined by changing the vector utilized as a reporter 
construct. The UTRs flanked by a reporter gene from the first reporter construct in which an 
effect on reporter gene expression was detected following exposure to a compound may be 
inserted into a new reporter construct that has, e.g., different transcriptional regulation 
elements (e.g., a different promoter) and a different selectable marker. The level of reporter 
gene expression in the presence of the compound can be compared to the level of reporter 
gene expression in the absence of the compound or in the presence of a control (e.g., PBS). If 
there is no change in the level of expression of the reporter gene in the presence of the 
compound relative to the absence of the compound or in the presence of a control, then the 
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compound probably is functioning in an UTR-independent manner. 

Compounds, identified in assays of the present invention, that are capable of 
modulating UTR-dependent expression of a target gene (for convenience referred to herein as 
a "lead" compound) can be further tested for UTR-dependent binding to the target RNA 
(which contains at least one UTR, and preferably at least one element of an UTR, for 
example a PTCRE). Furthermore, by assessing the effect of a compound on target gene 
expression, cw-acting elements, i.e., specific nucleotide sequences, that are involved in UTR- 
dependent expression may be identified. RNA binding assays, subtraction assays, and 
expressed protein concentration and activity assays are examples of methods to determine 
UTR-dependent expression of a gene. 

Hybrids 

In one aspect of the present invention, a hybrid of a compound and a PTCRE or a 
NeRP of the present invention is a hybrid formed between two non-identical molecules. In a 
preferred aspect, a hybrid can be formed between two nucleic acid molecules. For example, a 
hybrid can be formed between two ribonucleic acid molecules, between a ribonucleic acid 
molecule and a deoxyribonucleic acid molecule, or between derivatives of either. In 
alternative embodiment, a hybrid can be formed between a nucleic acid of the present 
invention and a non-nucleic acid molecule. In a preferred embodiment, a hybrid can be 
fonned between a nucleic acid molecule and a non-nucleic acid molecule, for example, a 
polypeptide or a non-peptide therapeutic agent. 

Ribozymes 

In one aspect of the present invention, the activity or expression of a gene is regulated 
by designing trans-cleaving catalytic RNAs (ribozymes) specifically directed to a nucleic 
acid molecule of the present invention, for example, SEQ ID NO: 3 and SEQ ID NOs: 5-8. 
In an alternate aspect, the activity or expression of a gene is regulated by designing trans- 
cleaving catalytic RNAs (ribozymes) specifically directed to a nucleic acid molecule of the 
present invention, for example, SEQ ID NO: 4 and SEQ ID NOs: 9-12. 

Ribozymes are RNA molecules possessing endoribonuclease activity. Ribozymes are 
specifically designed for a particular target, and the target message contains a specific 
nucleotide sequence. They are engineered to cleave any RNA species site-specifically in the 
background of cellular RNA. The cleavage event renders the mRNA unstable and prevents 
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protein expression. Importantly, ribozymes can be used to inhibit expression of a gene of 
unknown function for the purpose of determining its function in an in vitro or in vivo context, 
by detecting a phenotypic effect. 

One commonly used ribozyme motif is the hammerhead, for which the substrate 
sequence requirements are minimal. Design of the hammerhead ribozyme, and the 
therapeutic uses of ribozymes, are disclosed in Usman et al, Current Opin. Strict. Biol. 
6:527-533 (1996). Ribozymes can also be prepared and used as described in Long et al, 
FASEBJ. 7:25 (1993); Symons, Ann. Rev. Biochem. 61:641 (1992); Perrotta et al, Biochem. 
31:16-17 (1992); Ojwang et al., PNAS 89:10802-10806 (1992); and U.S. Patent No. 
5,254,678. 

Ribozyme cleavage of HIV-I RNA, methods of cleaving RNA using ribozymes, 
methods for increasing the specificity of ribozymes, and the preparation and use of ribozyme 
fragments in a hammerhead structure are described in U.S. Patent Nos. 5,144,019; 5,1 16,742; 
and 5,225,337 and Koizumi et al, Nucleic Acid Res. 17:7059-7071 (1989). Preparation and 
use of ribozyme fragments in a hairpin structure are described by Chowrira and Burke, 
Nucleic Acids Res. 20:2835 (1992). Ribozymes can also be made by rolling transcription as 
described in Daubendiek and Kool, Nat. Biotechnol. 15(3):273-277 (1997). 

The hybridizing region of the ribozyme may be modified or may be prepared as a 
branched structure as described in Horn and Urdea, Nucleic Acids Res. \1 .6959-61 (1989). 
The basic structure of the ribozymes may also be chemically altered in ways familiar to those 
skilled in the art, and chemically synthesized ribozymes can be administered as synthetic 
oligonucleotide derivatives modified by monomeric units. In a therapeutic context, liposome 
mediated delivery of ribozymes improves cellular uptake, as described in Birikh et al, Eur. J. 
Biochem. 245:1-16(1997). 

Ribozymes of the present invention also include RNA endoribonucleases (hereinafter 
"Cech-type ribozymes") such as the one which occurs naturally in Tetrahymena thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been extensively described by Thomas 
Cech and collaborators (Zaug et al, Science 224:574-578 (1984); Zaug and Cech, Science 
231:470-475 (1986); Zaug et al, Nature, 324:429-433 (1986); W0 88/04300; Been and Cech, 
Cell 47 '.-207-216 (1986)). The Cech-type ribozymes have an eight base pair active site which 
hybridizes to a target RNA sequence whereafter cleavage of the target RNA takes place. The 
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invention encompasses those Cech-type ribozymes which target eight base-pair active site 
sequences that are present in a target gene. 

Ribozymes can be composed of modified oligonucleotides {e.g., for improved 
stability, targeting, etc.) and should be delivered to cells which express the target gene in 
vivo. A preferred method of delivery involves using a DNA construct "encoding" the 
ribozyme under the control of a strong constitutive pol III or pol II promoter, so that 
transfected cells will produce sufficient quantities of the ribozyme to destroy endogenous 
messages and inhibit translation. Because ribozymes, unlike antisense molecules, are 
catalytic, a lower intracellular concentration is required for efficiency. 

Using the nucleic acid sequences of the invention and methods known in the art, 
ribozymes are designed to specifically bind and cut the corresponding mRNA species. 
Ribozymes thus provide a means to inhibit the expression of any of the proteins encoded by 
the disclosed nucleic acids or their full-length genes. The full-length gene need not be known 
in order to design and use specific inhibitory ribozymes. In the case of a nucleic acid or 
cDNA of unknown function, ribozymes corresponding to that nucleotide sequence can be 
tested in vitro for efficacy in cleaving the target transcript. Those ribozymes that effect 
cleavage in vitro are further tested in vivo. The ribozyme can also be used to generate an 
animal model for a disease, as described in Birikh et al., Eur. J. Biochem. 245:1-16 (1997). 
An effective ribozyme is used to determine the function of the gene of interest by blocking its 
transcription and detecting a change in the cell. Where the gene is found to be a mediator in a 
disease, an effective ribozyme is designed and delivered in a gene therapy for blocking 
transcription and expression of the gene. 

Therapeutic and functional genomic applications of ribozymes begin with knowledge 
of a portion of the coding sequence of the gene to be inhibited. Thus, for many genes, a 
partial nucleic acid sequence provides adequate sequence for constructing an effective 
ribozyme. A target cleavage site is selected in the target sequence, and a ribozyme is 
constructed based on the 5' and 3' nucleotide sequences that flank the cleavage site. 
Retroviral vectors are engineered to express monomeric and multimeric hammerhead 
ribozymes targeting the mRNA of the target coding sequence. These monomeric and 
multimeric ribozymes are tested in vitro for an ability to cleave the target mRNA. A cell line 
is stably transduced with the retroviral vectors expressing the ribozymes, and the transduction 
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is confirmed by Northern blot analysis and reverse-transcription polymerase chain reaction 
(RT-PCR). The cells are screened for inactivation of the target mRNA by such indicators as 
reduction of expression of disease markers or reduction of the gene product of the target 
mRNA. 

Cells and Organisms 

Nucleic acid molecules that may be used in cell transformation or transfection can be 
any of the nucleic acid molecules of the present invention. Nucleic acid molecules of the 
present invention can be introduced into a cell or organism. In a preferred aspect, the cell is 
selected from the group consisting of cells that do not express VEGF, cells that express 
VEGF, or cells that express VEGF conditionally. In a more preferred aspect, the cell is a 
cancer cell, more preferably a cancer cell where VEGF is overexpressed relative to a non- 
transformed cell. 

A host cell strain can be chosen for its ability to modulate the expression of the 
inserted sequences, to process an expressed reporter gene in the desired fashion, or based on 
the expression levels of an endogenous or heterologous VEGF gene. Mammalian cell lines 
available as hosts for expression are known in the ait and include many immortalized cell 
lines available from the American Type Culture Collection (ATCC, Manassas, VA), such as 
HeLa cells, Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells and a 
number of other cell lines. Non-limiting examples of suitable mammalian host cell lines 
include those shown below in Table 1 . 



Table 1 : Mammalian Host Cell Lines 



Host Cell 


Origin 


Source 


HepG-2 


Human Liver Hepatoblastoma 


ATCC HB 8065 


CV-1 


African Green Monkey Kidney 


ATCC CCL 70 


LLC-MK 2 


Rhesus Monkey Kidney 


ATCC CCL 7 


3T3 


Mouse Embryo Fibroblasts 


ATCC CCL 92 


AVI 2-664 


Syrian Hamster 


ATCC CRL 9595 


HeLa 


Human Cervix Epitheloid 


ATCC CCL 2 


RPMI8226 


Human Myeloma 


ATCC CCL 155 


H4IIEC3 


Rat Hepatoma 


ATCC CCL 1600 
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C127T 


Mouse Fibroblast 


ATCC CCL 1616 


293 


Human Embryonal Kidney 


ATCC CRL 1573 


HS-Sultan 


Human Plasma Cell Plasmacytoma 


ATCC CCL 1484 


BHK-21 


Baby Hamster Kidney 


ATCC CCL 10 


CHO-K1 


Chinese Hamster Ovary 


ATCC CCL 61 



In a preferred aspect, cells of the present invention can be cells of an organism. In a 
more preferred aspect, the organism is a mammal. In a most preferred aspect, the mammal is 
a human. In another more preferred aspect, the organism is a non-human mammal, 
preferably a mouse, rat, or a chimpanzee. In one aspect of the present invention, cells can be 
pluripotent or differentiated. 

A nucleic acid of the present invention can be naturally occurring in the cell or can be 
introduced using techniques such as those described in the art. There are many methods for 
introducing transforming DNA segments into cells, but not all are suitable for delivering 
DNA to eukaryotic cells. Suitable methods include any method by which DNA can be 
introduced into a cell, such as by direct delivery of DNA, by desiccation/inhibition-mediated 
DNA uptake, by electroporation, by agitation with silicon carbide fibers, by acceleration of 
DNA coated particles, by chemical transfection, by lipofection or liposome-mediated 
transfection, by calcium chloride-mediated DNA uptake, etc. For example, without 
limitation, Lipofectamine® (Invitrogen Co., Carlsbad, CA) and Fugene® (Hoffmann-La 
Roche Inc., Nutley, NJ) can be used for transfection of nucleic acid molecules, such as 
constructs and siRNA, into several mammalian cells. Alternatively, in certain embodiments, 
acceleration methods are preferred and include, for example, microprojectile bombardment 
and the like. Within the scope of this invention, the transfected nucleic acids of the present 
invention may be expressed transciently or stably. Such transfected cells can be in a two- or 
three-dimensional cell culture system or in an organism. 

For example, without limitation, the construct may be an autonomously replicating 
construct, i.e., a construct that exists as an extrachromosomal entity, the replication of which 
is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a 
minichromosome, or an artificial chromosome. The construct may contain any means for 
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assuring self-replication. For autonomous replication, the construct may further comprise an 
origin of replication enabling the construct to replicate autonomously in the host cell in 
question. Alternatively, the construct may be one which, when introduced into the cell, is 
integrated into the genome and replicated together with the chromosome(s) into which it has 
been integrated. This integration may be the result of homologous or non-homologous 
recombination. 

Integration of a construct or nucleic acid into the genome by homologous 
recombination, regardless of the host being considered, relies on the nucleic acid sequence of 
the construct. Typically, the construct contains nucleic acid sequences for directing 
integration by homologous recombination into the genome of the host. These nucleic acid 
sequences enable the construct to be integrated into the host cell genome at a precise location 
or locations in one or more chromosomes. To increase the likelihood of integration at a 
precise location, there should be preferably two nucleic acid sequences that individually 
contain a sufficient number of nucleic acids, preferably 400 residues to 1500 residues, more 
preferably 800 residues to 1000 residues, which are highly homologous with the 
corresponding host cell target sequence. This enhances the probability of homologous 
recombination. These nucleic acid sequences may be any sequence that is homologous with 
a host cell target sequence and, furthermore, may or may not encode proteins. 

Stable expression is preferred for long-term, high-yield production of recombinant 
proteins. For example, cell lines that stably express a reporter gene can be transformed using 
expression constructs that can contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate construct. 
Following the introduction of the construct, cells can be allowed to grow for 1-2 days in an 
enriched medium before they are switched to a selective medium. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth and 
recovery of cells that successfully express the introduced construct. Resistant clones of 
stably transformed cells can be proliferated using tissue culture techniques appropriate to the 
cell type. See, for example, Animal Cell Culture, R.I. Freshney, ed., 1986. 

Any number of selection systems can be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler et al, 1 977. 
Cell vol. 11 :223-32) and adenine phosphoribosyltransferase (Lowy et al, 1980 Cell vol. 
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22:817-23.) genes which can be employed in tk~ or aprf cells, respectively. Also, 
antimetabolite, antibiotic, or herbicide resistance can be used as the basis for selection. For 
example, dhfr confers resistance to methotrexate (Wigler etal, 1980. Proc, Natl. Acad. Set 
vol. 77:3567-70), npt confers resistance to the aminoglycosides, neomycin and G-418 
(Colbere-Garapin et al, 1981. J. Mol. Biol, vol.150: 1-14), and ah and pat confer resistance 
to chlorsulfuron and phosphinotricin acetyltransferase, respectively. Additional selectable 
genes have been described. For example, trpB allows cells to utilize indole in place of 
tryptophan, and hisD allows cells to utilize histinol in place of histidine (Hartman & 
Mulligan, 1988. Proc. Natl Acad. Sci. vol. 85:8047-51). Visible markers such as 
anthocyanins, fS-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, can be used to identify transformants and to quantify the amount of transient or 
stable protein expression attributable to a specific construct system (Rhodes et al, 1995. 
Methods Mol Biol vol. 55:121-131). 

Although the presence of marker gene expression suggests that a reporter gene is also 
present, its presence and expression may need to be confirmed. For example, if a sequence 
encoding a reporter gene is inserted within a marker gene sequence, transformed cells 
containing sequences that encode a reporter gene can be identified by the absence of marker 
gene function. Alternatively, a marker gene can be placed in tandem with a sequence 
encoding a reporter gene under the control of a single promoter. Expression of the marker 
gene in response to induction or selection usually indicates expression of a reporter gene. 

Alternatively, host cells which contain a reporter gene and which express a reporter 
gene e can be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations and 
protein bioassay or immunoassay techniques that include membrane, solution, or chip-based 
technologies for the detection and/or quantification of nucleic acid or protein. For example, 
the presence of a reporter gene can be detected by DNA-DNA or DNA-RNA hybridization or 
amplification using probes or fragments or fragments of polynucleotides encoding a reporter 
gene. Nucleic acid amplification-based assays involve the use of oligonucleotides selected 
from sequences encoding a reporter gene to detect transformants that contain a reporter gene. 
Screening Methods of the Present Invention 

Compound 
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The present invention includes methods for screening compounds capable of 
modulating gene expression. Any compound can be screened in an assay of the present 
invention. 

In an embodiment, a compound includes a nucleic acid or a non-nucleic acid, such as 
a polypeptide or a non-peptide therapeutic agent. In a preferred embodiment, a nucleic acid 
can be a polynucleotide, a polynucleotide analog, a nucleotide, or a nucleotide analog. In a 
more preferred embodiment, a compound can be an antisense oligonucleotide, which are 
nucleotide sequences complementary to a specific DNA or RNA sequence of the present 
invention. Preferably, an antisense oligonucleotide is at least 1 1 nucleotides in length, but 
can be at least 12, 1 5, 20, 25, 30, 35, 40, 45, or 50 or more nucleotides long. Longer 
sequences also can be used. Antisense oligonucleotides can be deoxyribonucleotides, 
ribonucleotides, or a combination of both. 

Nucleic acid molecules, including antisense oligonucleotide molecules, can be 
provided in a DNA construct and introduced into a cell. Nucleic acid molecules can be anti- 
sense or sense and double- or single-stranded. In a preferred embodiment, nucleic acid 
molecules can be interfering RNA (RNAi) or microRNA (miRNA). In a preferred 
embodiment, the dsRNA is 20-25 residues in length, termed small interfering RNAs 
(siRNA). 

Oligonucleotides can be synthesized manually or by an automated synthesizer, by 
covalently linking the 5' end of one nucleotide with the 3' end of another nucleotide with non- 
phosphodiester internucleotide linkages such alkylphosphonates, phosphorothioates, 
phosphorodithioates, alkylphosphonothioates, alkylphosphonates, phosphoramidates, 
phosphate esters, carbamates, acetamidate, carboxymethyl esters, carbonates, and phosphate 
triesters. See Brown, 1994 Meth. Mol. Biol. vol. 20:1-8; Sonveaux, 1994. Meth. Mol. Biol 
Vol. 26:1-72; and Uhlmann et al, 1990. Chem. Rev. vol. 90:543-583. Salts, esters, and other 
pharmaceutically acceptable forms of such compounds are also encompassed. 

In a preferred embodiment, a compound can be a peptide, polypeptide, polypeptide 
analog, amino acid, or amino acid analog. Such a compound can be synthesized manually or 
by an automated synthesizer. 

A compound can be a member of a library of compounds. In a specific embodiment, 
the compound is selected from a combinatorial library of compounds comprising peptoids; 
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random biooligomers; diversomers such as hydantoins, benzodiazepines and dipeptides; 
vinylogous polypeptides; nonpeptidal peptidomimetics; oligocarbamates; peptidyl 
phosphonates; peptide nucleic acid libraries; antibody libraries; carbohydrate libraries; and 
small organic molecule libraries. In a preferred embodiment, the small organic molecule 
libraries are libraries of benzodiazepines, isoprenoids, thiazolidinones, metathiazanones, 
pyrrolidines, morpholino compounds, or diazepindiones. 

In another embodiment, a compound can have a molecular weight less than about 
10,000 grams per mole, less than about 5,000 grams per mole, less than about 1,000 grams 
per mole, less than about 500 grams per mole, less than about 100 grams per mole, and salts, 
esters, and other pharmaceutically acceptable forms of such compounds. 

Compounds can be evaluated comprehensively for cytotoxicity. The cytotoxic effects 
of the compounds can be studied using cell lines, including for example 293 (kidney), HuH7 
(liver), and Hela cells over about 4, 10, 16, 24, 36 or 72-hour periods. In addition, a number 
of primary cells such as normal fibroblasts and peripheral blood mononuclear cells (PBMCs) 
can be grown in the presence of compounds at various concentrations for about 4 days. Fresh 
compound can be added every other day to maintain a constant level of exposure with time. 
The effect of each compound on cell-proliferation can be determined by CellTiter 96® 
AQueous One Solution Cell Proliferation Assay (Promega Co, Madison, WI) and [ 3 H]- 
Thymidine incorporation. Treatment of some cells with some of the compounds may have 
cytostatic effects. A selective index (ratios of CC50 in cytotoxicity assays to the EC50 in 
ELISA or FACS or the reporter gene assays) for each compound can be calculated for all of 
the UTR-reporters and protein inhibition assays. Compounds exhibiting substantial selective 
indices can be of interest and can be analyzed further in the functional assays. 

The structure of a compound can be determined by any well-known method such as 
mass spectroscopy, NMR, vibrational spectroscopy, or X-ray crystallography as part of a 
method of the present invention. 

Compounds can be pharmacologic agents already known in the art or can be 
compounds previously unknown to have any pharmacological activity. The compounds can 
be naturally occurring or designed in the laboratory. They can be isolated from 
microorganisms, animals, or plants, and can be produced recombinantly, or synthesized by 
chemical methods known in the art. If desired, compounds can be obtained using any of the 
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numerous combinatorial library methods known in the art, including but not limited to, 
biological libraries, spatially addressable parallel solid phase or solution phase libraries, 
synthetic library methods requiring deconvolution, the "one-bead one-compound" library 
method, and synthetic library methods using affinity chromatography selection. Methods for 
the synthesis of molecular libraries are well known in the art (see, for example, DeWitt et al., 
Proc. Natl. Acad. Sci. U.S.A. 90, 6909, 1993; Erb et al. Proc. Natl. Acad. Sci. U.S.A. 91, 
1 1422, 1994; Zuckermann et al, J. Med. Chem. 37, 2678, 1994; Cho et al., Science 261, 
1303, 1993; Carell et al., Angew. Chem. Int. Ed. Engl. 33, 2059, 1994; Carell et al., Angew. 
Chem. Int. Ed. Engl. 33, 2061; Gallop et al., J. Med. Chem. 37, 1233, 1994). Libraries of 
compounds can be presented in solution (see, e.g., Houghten, BioTechniques 13, 412-421, 

1992) , or on beads (Lam, Nature 354, 82-84, 1991), chips (Fodor, Nature 364, 555-556, 

1993) , bacteria or spores (Ladner, U.S. Pat. No. 5,223,409), plasmids (Cull et al., Proc. Natl. 
Acad. Sci. U.S.A. 89, 1865-1869, 1992), or phage (Scott & Smith, Science 249, 386-390, 
1990; Devlin, Science 249, 404-406, 1990); Cwirla et al., Proc. Natl. Acad. Sci. 97, 6378- 
6382, 1990; Felici, J. Mol. Biol. 222, 301-310, 1991; and Ladner, U.S. Pat. No. 5,223,409). 

Methods of the present invention for screening compounds can select for compounds 
capable of modulating gene expression, which are capable of directly binding to a ribonucleic 
acid molecule transcribed from a target gene. In a preferred embodiment, a compound 
identified in accordance with the methods of the present invention may be capable of binding 
to one or more rrara-acting factors (such as, but not limited to, proteins) that modulate UTR- 
dependent expression of a target gene. In another preferred embodiment, a compound 
identified in accordance with the methods of invention may disrupt an interaction between 
the5'UTRandthe3'UTR. 

Compounds can be tested using in vitro assays {e.g., cell-free assays) or in vivo assays 
(e.g., cell-based assays) well known to one of skill in the art or as provided in the present 
invention. A compound that modulates expression of a target gene can be determined from 
the methods provided in the present invention. A UTR of the present invention includes 
UTRs capable of modulating gene expression in the presence, in the absence, or in the 
presence and absence of a compound. In a preferred embodiment, the effect of a compound 
on the expression of one or more genes can be determined utilizing assays well known to one 
of skill in the art or provided by the present invention to assess the specificity of a particular 
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compound's effect on the UTR-dependent expression of a target gene. In a more preferred 
embodiment, a compound has specificity for a plurality of genes. In another more preferred 
embodiment, a compound identified utilizing the methods of the present invention is capable 
of specifically effect the expression of only one gene or, alternatively, a group of genes 
within the same signaling pathway. Compounds identified in the assays of the present 
invention can be tested for biological activity using host cells containing or engineered to 
contain the target RNA element involved in UTR-dependent gene expression coupled to a 
functional readout system. 
Screening assays 

The present invention includes and provides for assays capable of screening for 
compounds capable of modulating gene expression. In a preferred aspect of the present 
invention, an assay is an in vitro assay. In another aspect of the present invention, an assay is 
an in vivo assay. In another preferred aspect of the present invention, an assay measures 
translation. In a preferred aspect of the present invention, the assay includes a nucleic acid 
molecule of the present invention or a construct of the present invention. A nucleic acid 
molecule or construct of the present invention include, without limitation, SEQ ID NOs: 3- 
12, or a sequence that differs from any of the residues in SEQ ID NOs: 3-12 in that the 
nucleic acid sequence has been deleted, substituted, or added in a manner that does not alter 
the function. The present invention also provides fragments and complements of all the 
nucleic acid molecules of the present invention. 

In one embodiment of the present invention, the activity or expression of a reporter 
gene is modulated. Modulated means increased or decreased during any point before, after, 
or during translation. In a preferred embodiment, activity or expression of a reporter gene is 
modulated during translation. For example, inhibition of translation of the reporter gene 
would modulate expression. In an alternative example, expression level of a reporter gene is 
modulated if the steady-state level of the expressed protein decreased even though translation 
was not inhibited. For example, a change in the half-life of a mRNA can modulate 
expression. 

In an alternative embodiment, modulated activity or expression of a reporter gene 
means increased or decreased during any point before or during translation. 
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In a more preferred aspect, the activity or expression of a reporter gene or a target 
gene is modulated by greater than 30%, 40%, 50%, 60%, 70%, 80% or 90% in the presence 
of a compound. In a highly preferred aspect, more of an effect is observed in VEGF- 
dependent cancer cells. 

In a most preferred aspect, the activity or expression of a reporter gene is modulated 
without altering the activity of a control gene for general, indiscriminate translation activity. 
As used herein, indiscriminate translation activity refers to modulation in translation levels or 
activity that is random or unsystematic. One assay for modulation in general, indiscriminate 
translation activity uses a general translational inhibitor, for example puromycin, which is an 
inhibitor that causes release of nascent peptide and mRNA from ribosomes. 

Expression of a reporter gene can be detected with, for example, techniques know in 
the art. Translation or transcription of a reporter gene can be detected in vitro or in vivo. In 
detection assays, either the compound or the reporter gene can comprise a detectable label, 
such as a fluorescent, radioisotopic, chemiluminescent, or enzymatic label, such as 
horseradish peroxidase, alkaline phosphatase, or luciferase. 

High-throughput screening can be done by exposing nucleic acid molecules of the 
present invention to a library of compounds and detecting gene expression with assays known 
in the art, including, for example without limitation, those described above. In one 
embodiment of the present invention, cancer cells, such as HeLa cells, expressing a nucleic 
acid molecule of the present invention are treated with a library of compounds. Percent 
inhibition of reporter gene activity can be obtained with all the library compounds can be 
analyzed using, for example without limitation, a scattergram generated by SpotFire® 
(SpotFire, Inc., Somerville, MA). The high-throughput screen can be followed by subsequent 
selectivity screens. In a preferred embodiment, a subsequent selectivity screen can include 
detection of reporter gene expression in cells expressing, for example, a reporter gene linked 
to a PTCRE or flanked by a 5' and 3' UTR of the same gene, either of which contains a 
PTCRE or a NeRP or both of the present invention. In an alternative preferred embodiment, 
a subsequent selectivity screen can include detection of reporter gene expression in cells in 
the presence of a various concentrations of compounds. 

Once a compound has been identified to modulate UTR-dependent expression of a 
target gene and preferably, the structure of the compound has been identified by the methods 
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described in the present invention and well known in the art, the compounds are tested for 
biological activity in further assays and/or animal models. Further, a lead compound may be 
used to design congeners or analogs. 

A wide variety of labels and conjugation techniques are known by those skilled in the 
art and can be used in various nucleic acid and amino acid assays. Means for producing 
labeled hybridization or PCR probes for detecting sequences related to PTCREs or NeRPs of 
the present invention include oligolabeling, nick translation, end-labeling, or PCR 
amplification using a labeled nucleotide. Suitable reporter molecules or labels which can be 
used for ease of detection include radionuclides, enzymes, and fluorescent, 
chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

In vitro 

The present invention includes and provides for assays capable of screening for 
compounds capable of modulating gene expression. In a preferred aspect of the present 
invention, an assay is an in vitro assay. In a preferred aspect of the present invention, an in 
vitro assay that measures translation. In a preferred aspect of the present invention the in 
vitro assay includes a nucleic acid molecule of the present invention or a construct of the 
present invention. 

In one embodiment, a reporter gene of the present invention can encode a fusion 
protein or a fusion protein comprising a domain that allows the expressed reporter gene to be 
bound to a solid support. For example, glutathione- S-transferase fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtiter plates, which are then combined with the compound or the compound 
and the non-adsorbed expressed reporter gene; the mixture is then incubated under conditions 
conducive to complex formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microtiter plate wells are washed to remove any unbound 
components. Binding of the interactants can be determined either directly or indirectly, as 
described above. Alternatively, the complexes can be dissociated from the solid support 
before binding is determined. 

Other techniques for immobilizing an expressed reporter gene or compound on a solid 
support also can be used in the screening assays of the invention. For example, either an 
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expressed reporter gene or compound can be immobilized utilizing conjugation of biotin and 
streptavidin. Biotinylated expressed reporter genes or compounds can be prepared from 
biotin-NHS(N-hydroxysuccinimide) using techniques well known in the art (e.g., 
biotinylation kit, Pierce Chemicals, Rockford, EL) and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemicals, Rockford, IL). Alternatively, 
antibodies which specifically bind to an expressed reporter gene or compound, but which do 
not interfere with a desired binding or catalytic site, can be derivatized to the wells of the 
plate. Unbound target or protein can be trapped in the wells by antibody conjugation. 

Methods for detecting such complexes, in addition to those described above for the 
GST-immobilized complexes, include immunodetection of complexes using antibodies which 
specifically bind to an expressed reporter gene or compound, enzyme-linked assays which 
rely on detecting an activity of an expressed reporter gene, electrophoretic mobility shift 
assays (EMSA), and SDS gel electrophoresis under reducing or non-reducing conditions. 

In one embodiment, translation of a reporter gene in vitro can be detected following 
the use of a reticulocyte lysate translation system, for example the TnT® Coupled 
Reticulocyte Lysate System (Promega Co., Madison, WI). In this aspect, for example, 
without limitation, RNA (100 ng) can be translated at 30° C in reaction mixtures containing 
70% reticulocyte lysate, 20 uM amino acids and RNase inhibitor (0.8 units/ul). After 45 
minutes of incubation, 20 |il of Luclite can be added and luminescence can be read on the 
View-Lux. Different concentrations of compounds can be added to the reaction in a final 
DMSO concentration of 2% and the EC 5 o values calculated. Puromycin can be used as 
control for general indiscriminate translation inhibition. In vitro transcripts encoding a 
reporter gene linked to specific UTRs from target genes, including GAPDH, XIAP, TNF-ct, 
and HIF-la, can also be used. 

To study the influence of cell-type specific factors, capped RNA can be translated in 
translation extracts prepared from specialized cells or cancer cell lines, for example without 
limitation, HT1080 cells (a human fibrosarcoma cell line). Briefly, the cells can be washed 
with PBS and swollen in hypotonic buffer (10 mM Hepes, pH 7.4, 1 5 mM KC1, 1.5 mM 
Mg(OAc) 2 , 2 mM DTT and 0.5 mM Pefabloc (Pentapharm Ltd. Co., Switzerland) for 5 
minutes on ice. The cells can be lysed using a Dounce homogenizer (100 strokes), and the 
extracts can be spun for 10 minutes at 10,000 x g. These clarified extracts can then be flash- 
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frozen in liquid nitrogen and stored in aliquots at -70°C. The translation reaction can be 
capped RNA (50 ng) in a reaction mixture containing 60% clarified translation extract, 15 
pM total amino acids, 0.2 mg/ml Creatine phosho-kinase, which are all in IX translation 
buffer (15 mM Hepes, pH 7.4, 85 mM KOAc, 1.5 mM Mg(OAc) 2 , 0.5 mM ATP, 0.075 mM 
GTP, 18 mM creatine diphosphate and 1.5 mM DTT). After incubation of the translation 
reaction for 90 min at 37°C, activity of the protein encoded by the reporter gene can be 
detected. For activity of luciferase, encoded by the luciferase gene serving as the reporter 
gene, addition of 20 pi of LucLite® (Packard instrument Co., Inc., Meriden, CT) can be used. 

Capped and uncapped RNAs can be synthesized in vitro using the T7 polymerase 
transcription kits (Ambion Inc., Austin, TX). Capped RNAs from a variety of nucleic acid 
molecules of the present invention, including without limitation, constructs with VEGF 
linked to a PTCRE of the present invention, constructs with a reporter gene linked only to a 
vector, constructs with GAPDH linked to a PTCRE, constructs with a HIF-la linked to a 
PTCRE, and constructs with a HIF-la not linked to a PTCRE, can be used in a similar in 
vitro system to study the influence of cell-type specific factors on translation. 

In vivo 

The present invention includes and provides for assays capable of screening for 
compounds capable of modulating gene expression. In a preferred aspect of the present 
invention, an assay is an in vivo assay. A preferred aspect of the present invention is an assay 
that measures translation, In a preferred embodiment of the present invention, an in vivo 
assay includes a nucleic acid molecule of the present invention or a construct of the present 
invention and can include the use of a cell or a cell or tissue within an organism. In a more 
preferred embodiment, an in vivo assay includes a nucleic acid molecule of the present 
invention present in a cell or a cell or tissue within an organism. 

In another embodiment, in vivo translation of a reporter gene can be detected. In a 
preferred embodiment, a reporter gene is transfected into a cancer cell obtained from a cell 
line available at the (American Type Culture Collection (ATCC), Manassas, VA), for 
example HeLa, MCF-7, and COS-7, BT474. In a more preferred embodiment, a cancer cell 
has an altered genome relative to a similarly derived normal, primary cell, and the 
mammalian cancer cell proliferates under conditions where such a primary cell would not. 
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Screening for compounds that modulate reporter gene expression can be carried out in 
an intact cell. Any cell that comprises a reporter gene can be used in a cell-based assay 
system. A reporter gene can be naturally occurring in the cell or can be introduced using 
techniques such as those described above (see Cells and Organisms). In one embodiment, a 
cell line is chosen based on its expression levels of naturally occurring VEGF. Modulation of 
reporter gene expression by a compound can be determined in vitro as described above or in 
vivo as described below. 

To detect expression of endogenous protein, a variety of protocols for detecting and 
measuring the expression of a reporter gene are known in the art. For example, Enzyme- 
Linked Immunosorbent Assays (ELISAs), western blots using either polyclonal or 
monoclonal antibodies specific for an expressed reporter gene, Fluorescence-Activated Cell 
Sorter (FACS), electrophoretic mobility shift assays (EMSA), or radioimmunoassay (RIA) 
can be performed to quantify the level of specific proteins in lysates or media derived from 
cells treated with the compounds. In a preferred embodiment, a phenotypic or physiological 
readout can be used to assess UTR-dependent activity of the target RNA in the presence and 
absence of the lead compound. 

A wide variety of labels and conjugation techniques are known by those skilled in the 
art and can be used in various nucleic acid and amino acid assays. Means for producing 
labeled hybridization or PCR probes for detecting sequences related to polynucleotides 
having a PTCRE or a NeRP of the present invention include oligolabeling, nick translation, 
end-labeling, or PCR amplification using a labeled nucleotide. Alternatively, sequences 
having a PTCRE or a NeRP of the present invention can be cloned into a vector for the 
production of amRNA probe. Such vectors are known in the art, are commercially available, 
and can be used to synthesize RNA probes in vitro by addition of labeled nucleotides and an 
appropriate RNA polymerase such as T7, T3, or SP6. These procedures can be conducted 
using a variety of commercially available kits (Amersham Biosciences Inc., Piscataway, NJ; 
and Promega Co, Madison, WI). Suitable reporter molecules or labels which can be used for 
ease of detection include radionucleotides, enzymes, and fluorescent, chemilurninescent, or 
chromogenic agents, as well as substrates, cofactors; inhibitors, magnetic particles, and the 
like. 
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Therapeutic Uses 

The present invention also provides for methods for treating, preventing or 
ameliorating one or more symptoms of a disease or disorder associated with the aberrant 
expression of a target gene, said method comprising administering to a subject in need 
thereof a therapeutically or prophylactically effective amount of a compound, or a 
pharmaceutically acceptable salt thereof, identified according to the methods described 
herein. In one embodiment of the present invention, a target gene is aberrantly expressed. A 
target gene can be aberrantly overexpressed or expressed at an aberrantly low level. In 
particular, the invention provides for a method of treating or preventing a disease or disorder 
or ameliorating a symptom thereof, said method comprising administering to a subject in 
need thereof an effective amount of a compound, or a pharmaceutically acceptable salt 
thereof, identified according to the methods described herein, wherein said effective amount 
increases the expression of a target gene beneficial in the treatment or prevention of said 
disease or disorder. The invention also provides for a method of treating or preventing a 
disease or disorder or ameliorating a symptom thereof, said method comprising administering 
to a subject in need thereof an effective amount of a compound, or a pharmaceutically 
acceptable salt thereof, identified according to the methods described herein, wherein said 
effective amount decreases the expression of a target gene whose expression is associated 
with or has been linked to the onset, development, progression or severity of said disease or 
disorder. In a specific embodiment, the disease or disorder is a proliferative disorder, an 
inflammatory disorder, an infectious disease, a genetic disorder, an autoimmune disorder, a 
cardiovascular disease, or a central nervous system disorder. In an embodiment wherein the 
disease or disorder is an infectious disease, the infectious disease can be caused by a fungal 
infection, a bacterial infection, a viral infection, or an infection caused by another type of 
pathogen. 

In addition, the invention provides pharmaceutical compositions that can be 
administered to a patient to achieve a therapeutic effect. Pharmaceutical compositions of the 
invention can comprise, for example, ribozymes or antisense oligonucleotides, antibodies that 
specifically bind to a PTCRE or NeRP of the present invention, or mimetics, activators, 
inhibitors of PTCRE or NeRP activity, or a nucleic acid molecule of the present invention. 
The compositions can be administered alone or in combination with at least one other agent, 
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such as stabilizing compound, which can be administered in any sterile, biocompatible 
pharmaceutical carrier, including, but not limited to, saline, buffered saline, dextrose, and 
water. The compositions can be administered to a patient alone, or in combination with other 
agents, drugs or hormones. 

In addition to the active ingredients, these pharmaceutical compositions can contain 
suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. Pharmaceutical compositions of the invention can be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, 
intranasal, parenteral, topical, sublingual, or rectal means. Pharmaceutical compositions for 
oral administration can be formulated using pharmaceutically acceptable carriers well known 
in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical 
compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, 
slurries, suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combination of 
active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain 
tablets or dragee cores. Suitable excipients are carbohydrate or protein fillers, such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, or 
other plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium 
carboxymethylcellulose; gums including arabic and tragacanth; and proteins such as gelatin 
and collagen. If desired, disintegrating or solubilizing agents can be added, such as the cross- 
linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium alginate. 

Pharmaceutical preparations that can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or 
sorbitol. Push-fit capsules can contain active ingredients mixed with fillers or binders, such as 
lactose or starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. 
In soft capsules, the active compounds can be dissolved or suspended in suitable liquids, such 
as fatty oils, liquid, or liquid polyethylene glycol with or without stabilizers. 



47 



WO 2005/118857 



PCT/US2004/020751 



Pharmaceutical formulations suitable for parenteral administration can be formulated 
in aqueous solutions, preferably in physiologically compatible buffers such as Hanks' 
solution, Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions 
can contain substances that increase the viscosity of the suspension, such as sodium 
carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active 
compounds can be prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as 
ethyl oleate or triglycerides, or liposomes. Non- lipid polycationic amino polymers also can be 
used for delivery. Optionally, the suspension also can contain suitable stabilizers or agents 
that increase the solubility of the compounds to allow for the preparation of highly 
concentrated solutions. For topical or nasal administration, penetrants appropriate to the 
particular barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art. 

The pharmaceutical compositions of the present invention can be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or 
lyophilizing processes. The pharmaceutical composition can be provided as a salt and can be 
formed with many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, 
tartaric, malic, succinic, etc. Salts tend to be more soluble in aqueous or other protonic 
solvents than are the corresponding free base forms. In other cases, the preferred preparation 
can be a lyophilized powder which can contain any or all of the following: 1-50 mM 
histidine, 0.1%-2% sucrose, and 2-7% mannitol, at apH range of 4.5 to 5.5, that is combined 
with buffer prior to use. Further details on techniques for formulation and administration can 
be found in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing 
Co., Easton, Pa.). After pharmaceutical compositions have been prepared, they can be placed 
in an appropriate container and labeled for treatment of an indicated condition. Such labeling 
would include amount, frequency, and method of administration. 

Determination of a Therapeutically Effective Dose 

A therapeutically effective dose refers to that amount of active ingredient that 
increases or decreases reporter gene activity relative to reporter gene activity that occurs in 
the absence of the therapeutically effective dose. For any compound, the therapeutically 
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effective dose can be estimated initially either in cell culture assays or in animal models, 
usually mice, rabbits, dog, or pigs. The animal model also can be used to determine the 
appropriate concentration range and route of administration. Such information can then be 
used to determine useful doses and routes for administration in humans. 

Therapeutic efficacy and toxicity, e.g., ED 50 (the dose therapeutically effective in 
50% of the population) and LD 5 o (the dose lethal to 50% of the population), can be 
determined by standard pharmaceutical procedures in cell cultures or experimental animals. 
The dose ratio of toxic to therapeutic effects is the therapeutic index, and it can be expressed 
as the ratio, LD50/ED50. 

Pharmaceutical compositions that exhibit large therapeutic indices are preferred. The 
data obtained from cell culture assays and animal studies is used in formulating a range of 
dosage for human use. The dosage contained in such compositions is preferably within a 
range of circulating concentrations that include the ED 50 with little or no toxicity. The dosage 
varies within this range depending upon the dosage form employed, sensitivity of the patient, 
and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to 
the subject that requires treatment. Dosage and administration are adjusted to provide 
sufficient levels of the active ingredient or to maintain the desired effect. Factors that can be 
taken into account include the severity of the disease state, general health of the subject, age, 
weight, and gender of the subject, diet, time and frequency of administration, drug 
combination(s), reaction sensitivities, and tolerance/response to therapy. Long-acting 
pharmaceutical compositions can be administered every 3 to 4 days, every week, or once 
every two weeks depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts can vary from 0.1 to 100,000 micrograms, up to a total dose 
of about 1 g, depending upon the route of administration. Guidance as to particular dosages 
and methods of delivery is provided in the literature and generally available to practitioners in 
the art. Those skilled in the art will employ different formulations for nucleotides than for 
proteins or their inhibitors. Similarly, delivery of polynucleotides or polypeptides will be 
specific to particular cells, conditions, locations, etc. 

If the reagent is a single-chain antibody, polynucleotides encoding the antibody can 
be constructed and introduced into a cell either ex vivo or in vivo using well-established 
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techniques including, but not limited to, transferrin-polycation-mediated DNA transfer, 
transfection with naked or encapsulated nucleic acids, liposome-mediated cellular fusion, 
intracellular transportation of DNA-coated latex beads, protoplast fusion, viral infection, 
electroporation, "gene gun," and DEAE- or calcium phosphate-mediated transfection. 

Effective in vivo dosages of an antibody are in the range of about 5 ug to about 50 
ug/kg, about 50 ug to about 5 mg/kg, about 100 ug to about 500 ug/kg of patient body 
weight, and about 200 to about 250 ug/kg of patient body weight. For administration of 
polynucleotides encoding single-chain antibodies, effective in vivo dosages are in the range 
of about 100 ng to about 200 ng, 500 ng to about 50 mg, about 1 ug to about 2 mg, about 5 
ug to about 500 ug, and about 20 ug to about 100 ug of DNA. 

If the expression product is mRNA, the reagent is preferably an antisense 
oligonucleotide or a ribozyme. Polynucleotides that express antisense oligonucleotides or 
ribozymes can be introduced into cells by a variety of methods, as described above. 

Preferably, a reagent reduces expression of a reporter gene or the activity of a reporter 
gene by at least about 10, preferably about 50, more preferably about 75, 90, or 100% relative 
to the absence of the reagent. The effectiveness of the mechanism chosen to decrease the 
level of expression of a reporter gene or the activity of a reporter gene can be assessed using 
methods well known in the art, such as hybridization of nucleotide probes to reporter gene- 
specific mRNA, quantitative RT-PCR, immunologic detection of an expressed reporter gene, 
or measurement of activity from an expressed reporter gene. 

In any of the embodiments described above, any of the pharmaceutical compositions 
of the invention can be administered in combination with other appropriate therapeutic 
agents. Selection of the appropriate agents for use in combination therapy can be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents can act synergistically to effect the treatment or prevention 
of the various disorders described above. Using this approach, one may be able to achieve 
therapeutic efficacy with lower dosages of each agent, thus reducing the potential for adverse 
side effects. 

Any of the therapeutic methods described above can be applied to any subject in need 
of such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 
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Administration of a Therapeutically Effective Dose 

A reagent which affects translation can be administered to a human cell, either in vitro 
or in vivo, to specifically reduce translational activity of a specific gene. In a preferred 
embodiment, the reagent preferably binds to a 5' UTR of a gene. In an alternate 
embodiment, the present invention the reagent preferably binds to a PTCRE or NeRP of the 
present invention. In a preferred embodiment, the reagent is a compound. For treatment of 
human cells ex vivo, an antibody can be added to a preparation of stem cells which have been 
removed from the body. The cells can then be replaced in the same or another human body, 
with or without clonal propagation, as is known in the art. 

In one embodiment, the reagent is delivered using a liposome. Preferably, the 
liposome is stable in the animal into which it has been administered for at least about 30 
minutes, more preferably for at least about 1 hour, and even more preferably for at least about 
24 hours. A liposome comprises a lipid composition that is capable of targeting a reagent, 
particularly a polynucleotide, to a particular site in an animal, such as a human. Preferably, 
the lipid composition of the liposome is capable of targeting to a specific organ of an animal, 
such as the lung, liver, spleen, heart brain, lymph nodes, and skin. 

A liposome useful in the present invention comprises a lipid composition that is 
capable of fusing with the plasma membrane of the targeted cell to deliver its contents to the 
cell. Preferably, the transfection efficiency of a liposome is about 0.5 ug of DNA per 16 
nmole of liposome delivered to about 10 6 cells, more preferably about 1.0 jag of DNA per 16 
mnole of liposome delivered to about 10 6 cells, and even more preferably about 2.0 ug of 
DNA per 16 nmol of liposome delivered to about 10 6 cells. Preferably, a liposome is between 
about 100 and 500 nm, more preferably between about 150 and 450 nm, and even more 
preferably between about 200 and 400 nm in diameter. 

Suitable liposomes for use in the present invention include those liposomes standardly 
used in, for example, gene delivery methods known to those of skill in the art. More preferred 
liposomes include liposomes having a polycationic lipid composition and/or liposomes 
having a cholesterol backbone conjugated to polyethylene glycol. Optionally, a liposome 
comprises a compound capable of targeting the liposome to a particular cell type, such as a 
cell-specific ligand exposed on the outer surface of the liposome. 
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Complexing a liposome with a reagent such as an antisense oligonucleotide or 
ribozyme can be achieved using methods that are standard in the art (see, for example, U.S. 
Pat. No. 5,705,151). Preferably, from about 0.1 ugto about 10 ug of polynucleotide is 
combined with about 8 nmol of liposomes, more preferably from about 0.5 jag to about 5 ug 
of polynucleotides are combined with about 8 nmol liposomes, and even more preferably 
about 1.0 ug of polynucleotides is combined with about 8 nmol liposomes. 

In another embodiment, antibodies can be delivered to specific tissues in vivo using 
receptor-mediated targeted delivery. Receptor-mediated DNA delivery techniques are taught 
in, for example, Findeis et al. Trends in Biotechnol. 11, 202-05 (1993); Chiou et al., Gene 
Therapeutics: Methods And Applications Of Direct Gene Transfer (J. A. Wolff, ed.) (1994); 
Wu & Wu, J. Biol. Chem. 263, 621-24 (1988); Wu et al., J. Biol. Chem. 269, 542-46 (1994); 
Zenke et al., Proc. Natl. Acad. Sci. U.S.A. 87, 3655-59 (1990); Wu et al., J. Biol. Chem. 266, 
338-42 (1991). 

Diagnostic Methods 

Agents of the present invention can also be used in diagnostic assays for detecting 
diseases and abnormalities or susceptibility to diseases and abnormalities related to the 
presence of mutations in the nucleic acid sequences that encode a PTCRE or NeRP of the 
present invention. For example, differences can be determined between the cDNA or 
genomic sequence encoding a PTCRE or NeRP in individuals afflicted with a disease and in 
normal individuals. If a mutation is observed in some or all of the afflicted individuals but not 
in normal individuals, then the mutation is likely to be the causative agent of the disease. 

For example, the direct DNA sequencing method can reveal sequence differences 
between a reference gene and a gene having mutations. In addition, cloned DNA segments 
can be employed as probes to detect specific DNA segments. The sensitivity of this method 
is greatly enhanced when combined with PCR. For example, a sequencing primer can be 
used with a double-stranded PCR product or a single-stranded template molecule generated 
by a modified PCR. The sequence determination is performed by conventional procedures 
using radiolabeled nucleotides or by automatic sequencing procedures using fluorescent tags. 

Moreover, for example, genetic testing based on DNA sequence differences can be 
carried out by detection of alteration in electrophoretic mobility of DNA fragments in gels 
with or without denaturing agents. Small sequence deletions and insertions can be visualized, 
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for example, by high-resolution gel electrophoresis. DNA fragments of different sequences 
can be distinguished on denaturing formamide gradient gels in which the mobilities of 
different DNA fragments are retarded in the gel at different positions according to their 
specific melting or partial melting temperatures (see, e.g., Myers et al., Science 230, 1242, 
1985). Sequence changes at specific locations can also be revealed by nuclease protection 
assays, such as RNase and SI protection or the chemical cleavage method (e.g., Cotton et al., 
Proc. Natl. Acad. Sci. USA 85, 4397-4401, 1985). Thus, the detection of a specific DNA 
sequence can be performed by methods such as hybridization, RNase protection, chemical 
cleavage, direct DNA sequencing or the use of restriction enzymes and Southern blotting of 
genomic DNA. In addition to direct methods such as gel-electrophoresis and DNA 
sequencing, mutations can also be detected by in situ analysis. 

Altered levels of a PTCRE or NeRP of the present invention can also be detected in 
various tissues. For example, one or more genes having a PTCRE or a NeRP can be detected 
by assays used to detect levels of particular nucleic acid sequence, such as Southern 
hybridization, northern hybridization, and PCR. Alternatively, assays can be used to detect 
levels of a reporter polypeptide regulated by a PTCRE or a NeRP or of a polypeptide encoded 
by a gene having a PTCRE or a NeRP. Such assays are well known to those of skill in the art 
and include radioimmunoassays, competitive binding assays, western blot analysis, and 
ELISA assays. A sample from a subject, such as blood or a tissue biopsy derived from a 
host, may be the material on which these assays are conducted. 

Having now generally described the invention, the same will be more readily 
understood through reference to the following examples that are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Each periodical, patent, and other document or reference cited herein is herein 
incorporated by reference in its entirety. 
Examples 

Example 1. Identification of compounds that specifically inhibit VEGF expression 
post-transcriptionally. 

A monocistronic reporter construct (pLuc/vegf5'+3'UTR) is under the transcriptional 
control of the CMV promoter and contains a VEGF IRES driving the luciferase reporter, 
which nucleic acid sequences are both upstream of a VEGF 3'-UTR. Stable cell lines are 
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generated by transfecting 293 cells with the pLuc/vegf5'+3'UTR. A stable cell line is 
cultured under hygromycin B selection to create clonal cell lines consistent with protocols 
well known in the art. After two weeks of selection, clonal cell lines are screened for 
luciferase activity. The luciferase activity of several clonal cell lines (hereafter "clones") are 
compared and normalized against total protein content. Clones are maintained under 
hygromycin B selection for more than three months with intermittent monitoring of luciferase 
activity. Clones are stable and maintain a high level of luciferase expression. Many clones, 
for example, about twenty, may be compared to each other with respect to luciferase activity. 
In comparison to clones B9, D3, and H6, clone B9 exhibits the highest level of luciferase 
activity. In addition, semi-quantitative PCR analysis is performed, and the results indicate 
that multiple copies of the reporter are integrated per cell. Particular parameters for clones 
are studied prior to selection for use in post-transcriptional, high-throughput screening 
(PTHTS). Relevant parameters for PTHTS include, but are not limited to, cell number, 
incubation time, DMSO concentration, and volume of substrate. 

Chemical libraries in excess of 150,000 compounds are screened by PTHTS with a 
clone containing the monocistronic reporter construct, pLuc/vegf5'+3'UTR. Screens are 
performed in duplicate with each molecule at a single concentration of 7.5 uM. Bright-Glow 
(Promega Co., Madison, WI) is used as a substrate to measure firefly luciferase activity. 
Active compounds are identified by reporting the average percent inhibition of the duplicate 
runs followed by rejecting those compounds that did not provide satisfactory reproducibility. 
The average percent inhibition of compounds that provide satisfactory reproducibility is 
within a range of about 10%, about 25% or about 35% in duplicate runs. Data is analyzed as 
a normal distribution, which is apparent from graphical and statistical analysis of skewness 
and kurtosis. Hits are then reported at about a 99% confidence level, usually representing a 
selection of 3 standard deviations from the mean, or a hit lower limit of observed inhibition 
about equal to 50%. These selection criteria result in a hit rate of about 1%. 

Certain compounds that are identified through the PTHTS-screening tier by screening 
with clone B9 modulate hypoxia-inducible endogenous VEGF expression. Endogenous 
VEGF protein levels are monitored by an ELISA assay (R&D Systems, Minneapolis, MN). 
HeLa cells are used to evaluate hypoxia-inducible expression. HeLa cells demonstrate about 
a three- to five-fold hypoxia-inducible window as compared to normoxic conditions (about 
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1000 - about 1500 pg/ml under hypoxia compared to about 200 - about 400 pg/ml under 
normoxia). Cells are cultured overnight under hypoxic conditions (about 1 % 0 2 , about 5% 
CO2, and balanced with nitrogen) in the presence or absence of compounds. The conditioned 
media is assayed by ELISA. The concentration of VEGF is calculated from the standard 
ELISA curve of each assay. The assays are performed in duplicate at a compound 
concentration of about 7 uM. A threshold of about 50% inhibition for a compound is selected 
as a criterion for further investigation. Further evaluation of about 100 to about 150 
compounds is conducted from about 700 to about 800 initial PTHTS hits. The activity of the 
identified compounds is confirmed by repeating the experiments described above. The 
identified compounds are then acquired as dry powders and analyzed further. The purity and 
molecular weight of the identified compounds are confirmed by LC-MS. 

A dose-response analysis is performed using an ELISA assay and using conditions 
essentially as described above. A series of seven different concentrations are analyzed. In 
parallel, a dose-response cytotoxicity assay is performed under the same conditions as the 
ELISA to ensure that the inhibition of VEGF expression is not due to cytotoxicity as 
measured by CellTiter-Glo® (Promega, Inc., Madison, WI). Dose-response curves are plotted 
using percentage inliibition versus concentration of the compound. 

For each compound, the maximal inhibition is set as 100% and the minimal inhibition 
is set as 0% to generate EC50 and CC50 values. A compound from PTHTS shows a sigmoidal 
curve over a compound concentration range from about lC'riM to about 10 4 nM when plotted 
against the percent inhibition of VEGF expression on the y-axis {see Figure 2). The same 
compound from PTHTS shows a convex curve over the same compound concentration range 
plotted against the percent of cytotoxicity. The ELISA EC50 (50% inhibition of VEGF 
expression) for this particular compound is about 7 nM, while its CC50 (50% cytotoxicity) is 
greater than about 2000 nM. Subsets of compounds that show similar efficacy/cytotoxicity 
windows are also identified. 

The B9 cell line harbors the firefly luciferase reporter driven by the CMV promoter 
and flanked by the 5'- and 3'-UTRs of VEGF transcripts. Use of the B9 cell line with the 
PTHTS identifies compounds that specifically target the function of VEGF UTRs to 
modulate expression. Cell line B12 harbors the luciferase reporter in the absence of operably 
linked VEGF UTRs. Compounds that inhibit luciferase activity in both the B9 and B12 cell 
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lines are general transcription or general translation inhibitors or luciferase enzyme inhibitors. 
Several UTR specific compounds are identified in experiments with PTHTS identified 
compounds as described above. The dose-response curves of an identified compound show a 
concave curve in B9 cells and a sigmoidal curve in B12 cells when the percent luciferase 
inhibition of each is plotted over a compound concentration range from about lO^nM to 
about 10 4 nM on the x-axis (see Figure 3). The difference between the two cell lines (B9 and 
B12) shows that inhibition of VEGF production by this compound is through the VEGF 
UTRs, i.e., by a post-transcriptional control mechanism. A control experiment is performed 
with a general translation inhibitor, puromycin. Puromycin treatment does not change the 
difference of inhibition in luciferase expression in these two cell lines. 
Example 2. Characteristics of UTR-specific VEGF inhibitors 

All identified compounds are re-synthesized and shown by LC/MS and combustion 
analysis to be greater than 95% pure. Subsequently, the re-synthesized compounds are tested 
in the dose-response VEGF ELISA and luciferase assays that are used to initially assess UTR 
specificity. All identified compounds retain UTR specificity and are bona fide inhibitors of 
VEGF expression. 

PTHTS using B9 cells identified compounds that specifically inhibit hypoxia 
inducible VEGF expression for the treatment of ocular neovascular diseases. Compounds 
that target multiple angiogenesis factors (including VEGF) for the treatment of cancers are 
also identifiable. Several targets are used for these purposes, including TNF-a, FGF-2, G- 
CSF, IGF-1, PDGF, and HIF-la. 

ELISA assays analyze levels of expression of these factors using commercially 
available kits from R&D Systems (Minneapolis, MN). UTR-specific PTHTS identified 
compounds are tested for their ability to inhibit the expression of a subset of these proteins, 
including G-CSF, TNFa, FGF-2, and IGF-1. Identified compounds that are very potent 
inhibitors of VEGF production as assayed in HeLa cells have EC50 values ranging from low 
nM to high nM. Treatment with a general translation inhibitor (puromycin) results in similar 
inhibition for all these cytokines, with EC50 values ranging from about 0.2 to about 2 uM. 

Lead compounds are further characterized and optimized. Analogs are synthesized and 
identified compounds exhibit excellent potency in the VEGF ELISA assay (EC50 values 
ranging from 0.5 nM to 50 nM). In another embodiment, an analog exhibits low nM potency. 
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In an additional embodiment, several analogs are synthesized and a subset of identified 
compounds are very active (EC 50 values ranging from 1 to 50 nM) in the VEGF ELISA 
assay. Activity of a very potent analog is improved about 500-fold compared to its parent 
(EC50 of 1 nM vs. 500 nM). Further characterization and optimization for selectivity and 
pharmaceutical properties (ADMET) of the most active compounds will develop a drug 
candidate(s) for clinical trials. 

Example 3. Identified compounds are active as inhibitors of hypoxia-inducible VEGF 
production in retinal pigment epithelial cells and macrophage cells. 

PTHTS identified compounds are VEGF-specific inhibitors for the treatment of ocular 
neovascular disorders. The effect of the identified compounds on retinal pigment epithelial 
cells and macrophage cells are tested in two cell lines: ARPE-19, a human retinal pigment 
epithelial cell line, and RAW264, a mouse macrophage cell line. Both cell lines produce high 
levels of VEGF under hypoxic conditions. A subset of identified compounds is active in these 
two cell lines. Compound 1 inhibits VEGF production in both macrophage (a non-limiting 
example of which is RAW264.7) and retinal pigment epithelial cells (a non-limiting example 
of which is ARPE-19). In selectivity studies, as shown in Table 2, Compound 1 specifically 
inhibits VEGF expression relative to that of other factors (FGF-2, IGF-1 , GCSF, TNFa). 



Table 2: Selectivity studies 
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Example 4. Identified compounds are active in inhibition of VEGF expression and tumor 
growth in vivo. 
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A pharmacodynamic model assesses intratumor VEGF levels and selects compounds 
for in vivo efficacy. Preliminary data demonstrates that several of our compounds effectively 
inhibit VEGF production in tumor tissues (see Figure 4A). Briefly, HT1080 cells (a human 
fibrosarcoma cell line) are implanted subcutaneously in nude mice. After seven days, mice 
are administrated compounds orally at 20mg/kg/day for two weeks. The tumors are then 
excised from the mice and homogenized in Tris-HCl buffer containing proteinase inhibitors 
(Moulder, S.L., et al., Cancer Res. 61(24):8887-95, 2001). Intratumor VEGF levels are 
subsequently measured using a human VEGF ELISA kit (R&D System, Minneapolis, MN). 
Protein concentrations of the homogenates are measured with a Bio-Rad™ Protein assay kit 
and intratumor VEGF levels are normalized to the protein concentrations. Treatment with the 
identified compounds significantly reduces intratumor VEGF protein levels compared to the 
vehicle control. In addition, treatment with the identified compound for two weeks inhibits 
tumor growth as compared to the vehicle-treated control groups (see Figure 4B). 
Example 5. Mapping of Functional Domains in a 5 ' VEGF UTR. 

VEGF-5UTR1 and 5UTR2 are amplified from human genomic DNA. The full-length 
5' UTR is generated by ligation of the two fragments (see Figure 5). P21uc/VEGF5UTR-FL 
is generated by inserting the full-length 5' UTR into a dicistronic plasmid (p21uc) between 
Sal Zand Smal I sites. Other vectors are derived from p21uc/VEGF5UTR-FL by deleting 
relevant sequences. All these plasmids are tested in 293 cells by transient transfection. 
Figure 6 shows the relative fire-fly luciferase activity (normalized against Renilla luciferase) 
for each of the VEGF 5' UTR fragments. Similar results are obtained from repeating such 
experiments. 
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WHAT IS CLAIMED: 

1 . A nucleic acid construct comprising a nucleic acid sequence encoding a reporter 
polypeptide, wherein said nucleic acid sequence encoding a reporter polypeptide is 
operably linked to a NeRP, said NeRP is operably linked to a PTCRE, said PTCRE is 
not SEQ ID NO: 3, and expression of said reporter polypeptide is capable of being 
modulated relative to in an absence of said NeRP. 

2. The nucleic acid construct according to claim 1, wherein a NeRPl (SEQ ID NO: 4) 
is located within said nucleic acid sequence encoding a reporter polypeptide. 

3. The nucleic acid construct according to claim 1, wherein a NeRPl (SEQ ID NO: 4) 
is located downstream of said nucleic acid sequence encoding a reporter polypeptide. 

4. The nucleic acid construct according to claim 1, wherein a NeRPl (SEQ ID NO: 4) 
is located within an intron and said intron is located within said nucleic acid sequence 
encoding a reporter polypeptide. 

5. A nucleic acid molecule comprising a nucleic acid sequence encoding a reporter 
polypeptide and a VEGF 5' UTR nucleic acid sequence in an absence of SEQ ID NO: 
4. 

6. The nucleic acid molecule according to claim 5, wherein said VEGF 5' UTR in an 
absence of SEQ ID NO: 4 contains an internal ribosome entry site ("IRES"). 

7. The nucleic acid molecule according to claim 5, further comprising a high-level 
mammalian expression vector capable of integrating randomly into the genome. 

8. The nucleic acid molecule according to claim 5, further comprising a high-level 
mammalian expression vector capable of integrating site-selectively into the genome. 

9. The nucleic acid molecule according to claim 5, further comprising an episomal 
mammalian expression vector. 

10. The nucleic acid molecule according to claim 5, wherein said nucleic acid sequence 
encoding a reporter polypeptide contains an intron. 

1 1 . The nucleic acid molecule according to claim 5, wherein said VEGF 5' UTR in an 
absence of SEQ ID NO: 4 contains an intron. 

12. A nucleic acid molecule comprising a nucleic acid sequence encoding a reporter 
polypeptide operably linked to a VEGF 5' UTR in an absence of SEQ ID NO: 4. 
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13. The nucleic acid molecule according to claim 12, wherein said nucleic acid 
sequence encoding a reporter polypeptide contains an intron. 

14. A nucleic acid molecule comprising a nucleic acid sequence encoding a reporter 
polypeptide, wherein said nucleic acid sequence encoding a reporter polypeptide is 
operably linked downstream of a UTR containing a NeRP, and said UTR is not 
operably upstream of SEQ ID NO: 3. 

15. A heterogeneous population of nucleic acid molecules, wherein said heterogeneous 
population comprises a reporter nucleic acid sequence, and said nucleic acid sequence 
encoding a reporter polypeptide is operably linked to a VEGF 5' UTR in an absence of 
NeRPl (SEQ ID NO: 4). 

16. The heterogeneous population of nucleic acid molecules according to claim 15, 
wherein said heterogeneous population is isolated from a stable cell line. 

17. The heterogeneous population of nucleic acid molecules according to claim 15, 
wherein said population is produced in vitro. 

18. The heterogeneous population of nucleic acid molecules according to claim 15, 
wherein said population is used to produce polypeptides. 

19. The heterogeneous population of nucleic acid molecules according to claim 1 5, 
wherein said heterogeneous population is selected to exclude molecules with a 5' cap. 

20. The heterogeneous population of nucleic acid molecules according to claim 1 5, 
wherein said heterogeneous population is poly-adenylated. 

21. The heterogeneous population of nucleic acid molecules according to claim 15, 
wherein said heterogeneous population is not poly-adenylated. 

22. A substantially purified nucleic acid molecule comprising between 95% and 99% 
sequence identity with a nucleic acid molecule of SEQ ID NO: 3, a fragment thereof, or 
a complement of either. 

23. A substantially purified nucleic acid molecule consisting of SEQ ID NO: 3, a 
fragment thereof, or a complement of either. 

24. A substantially purified nucleic acid molecule consisting of a first nucleic acid 
sequence linked to a heterologous nucleic acid sequence encoding a polypeptide, 
wherein said first nucleic acid sequence is selected from the group consisting of SEQ 
ID NO: 3, a fragment thereof, and a complement of either. 
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25. A substantially purified nucleic acid molecule comprising between 95% and 99% 
sequence identity with a nucleic acid molecule of SEQ ID NO: 4, a fragment thereof, or 
a complement of either. 

26. A substantially purified nucleic acid molecule of a nucleic acid sequence selected 
from a group consisting of SEQ ID NO: 4, a fragment thereof, and a complement of 
either. 

27. A substantially purified nucleic acid molecule consisting of a first nucleic acid 
sequence linked to a heterologous nucleic acid sequence encoding a polypeptide, 
wherein said first nucleic acid sequence is selected from the group consisting of SEQ 
ID NO: 4, a fragment thereof, and a complement of either. 

28. A method of making a nucleic acid construct to screen for a compound comprising: 

a) providing a main ORF downstream of a promoter in said nucleic acid construct; 

b) operably linking a VEGF 5 ' UTR in an absence of SEQ ID NO : 4 upstream of 

said main ORF; and 

c) operably linking a VEGF 3 ' UTR downstream of said main ORF. 

29. The method according to claim 28, wherein said main ORF encodes a reporter 
polypeptide. 

30. A method of screening in vivo for a compound that modulates UTR-dependent 
expression comprising: 

a) providing a cell having a nucleic acid molecule comprising a promoter upstream 
from a VEGF 5' UTR in an absence of SEQ ID NO: 4, wherein said VEGF 5' UTR in 
an absence of SEQ ID NO: 4 is upstream from a nucleic acid sequence encoding a 
reporter polypeptide, and said nucleic acid sequence encoding a reporter polypeptide is 
upstream from a VEGF 3' UTR; 

b) contacting said cell with a compound; 

c) producing a nucleic acid molecule that contains a nucleic acid sequence encoding a 
reporter polypeptide and does not contain SEQ ID NO: 4; and 

d) detecting said reporter polypeptide. 

3 1 . The method according to claim 30, wherein said nucleic acid sequence encoding a 
reporter polypeptide is VEGF. 
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32. A method of screening in vitro for a compound that modulates UTR-affected 
expression comprising: 

a) providing an in vitro translation system; 

b) contacting said in vitro translation system with a compound and a nucleic acid 
molecule comprising a VEGF 5' UTR in an absence of SEQ ID NO: 4, wherein said 
VEGF 5' UTR in an absence of SEQ ID NO: 4 is upstream from a nucleic acid 
sequence encoding a reporter polypeptide and said nucleic acid sequence encoding a 
reporter polypeptide is upstream from a VEGF 3' UTR; and 

c) detecting said reporter polypeptide in vitro. 

33. A method of expressing a nucleic acid molecule in a cell comprising: 

a) providing a nucleic acid molecule to a cell, wherein said nucleic acid molecule 
comprises a nucleic acid sequence encoding a reporter polypeptide flanked by VEGF 
UTRs in an absence of SEQ ID NO: 4; and 

b) detecting said reporter polypeptide. 

34. The method according to claim 33, wherein said nucleic acid molecule is produced 
by in vitro transcription. 

35 . The method according to claim 33, wherein said nucleic acid molecule is a 
synthetically produced RNA molecule. 

36. A method of screening for a compound that modulates protein expression through a 
main ORF-independent, UTR-affected mechanism comprising: 

a) growing a stable cell line having a reporter gene operably linked to a VEGF 5' UTR 
in an absence of SEQ ID NO: 4; 

b) comparing said stable cell line in a presence of a compound relative to said stable 
cell line in an absence of said compound; and 

c) selecting for said compound that modulates protein expression through a main ORF- 
independent, UTR-affected mechanism. 

37. A method of screening for a compound that modulates protein expression through a 
main ORF-independent, UTR-affected mechanism comprising: 

a) growing a stable cell line having a main ORF operably linked to a VEGF 5 ' UTR in 
an absence of SEQ ID NO: 4; 

b) comparing said stable cell line in the presence of a compound relative to said stable 



62 



WO 2005/118857 



PCT/US2004/020751 



cell line in the absence of said compound; and 

c) selecting for said compound that modulates protein expression through a main ORF- 
independent, UTR-affected mechanism. 

38. A method of screening for a compound that modulates protein expression through a 
VEGF-independent, UTR-affected mechanism comprising: 

a) substituting in vivo a VEGF gene with a reporter gene, wherein a UTR consisting of 
SEQ ID NO: 3 is operably linked to said reporter gene, and said substitution occurs in a 
differentiated cell; 

b) growing said differentiated cell; and 

c) selecting for said compound that modulates protein expression of said reporter gene 
through a main ORF-independent, UTR-affected mechanism. 

39. A method of screening for a compound that modulates protein expression through a 
main ORF-independent, UTR-affected mechanism comprising: 

a) substituting in vivo a main ORF with a reporter gene, wherein a 5' UTR is operably 
linked to said reporter gene and consists of SEQ ID NO: 3, and said substitution occurs 
in a differentiated cell; 

b) growing said differentiated cell; and 

c) selecting for said compound that modulates protein expression of said reporter gene 
through a main ORF-independent, UTR-affected mechanism. 

40. A method of screening for a compound that modulates protein expression through a 
UTR-affected mechanism comprising: 

a) providing a stable cell line having a reporter gene operably linked to a VEGF 5' 
UTR in an absence of SEQ ID NO: 4, wherein said stable cell line mimics post- 
transcriptional regulation of a VEGF gene found in vivo in presence of said compound; 

b) maintaining said stable cell line; and 

c) selecting for said compound that modulates protein expression of said reporter gene 
through a UTR-affected mechanism. 

41 . The method according to claim 40, wherein said stable cell line is a transformed 
cell. 

42. A method of screening for a compound that modulates protein expression through a 
UTR-affected mechanism comprising: 
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a) providing a stable cell line having a main ORF encoding a reporter polypeptide 
operably linked to a VEGF 5' UTR in an absence of SEQ ID NO: 4, wherein said stable 
cell line mimics post-transcriptional regulation of a VEGF gene found in vivo in a 
presence of a compound; 

b) maintaining said stable cell line; and 

c) selecting for said compound that modulates protein expression of said main ORF 
through a UTR-affected mechanism. 

A method of screening for a compound that modulates protein expression through a 
UTR-affected mechanism mediating the effect of a NeRP comprising: 

a) growing a stable cell line having a reporter gene operably linked to a 5' VEGF UTR 
in an absence of a NeRP 1 (SEQ ID NO: 4); 

b) comparing said stable cell line in a presence of a compound relative to in an absence 
of said compound, wherein said compound does not modulate UTR-dependent 
expression if said 5' VEGF UTR in an absence of a NeRP 1 (SEQ ID NO: 4) is operably 
linked to a reporter gene; and 

c) selecting for said compound that modulates protein expression of said reporter gene 
through a UTR-affected mechanism mediating the effect of a NeRP. 

The method according to claim 43, wherein said c) selecting for said compound 
comprises comparing said stable cell line in the presence of a compound relative to in 
the absence of said compound, wherein said compound modulates UTR-dependent 
expression if said reporter gene is operably linked to said 5' VEGF UTR. 

A method of screening for a compound that modulates protein expression through a 
UTR-affected mechanism mediating the effect of a NeRP comprising: 

a) growing a stable cell line having a main ORF encoding a reporter polypeptide 
operably linked to a 5' VEGF UTR in an absence of a NeRP 1 (SEQ ID NO: 4); 

b) comparing said stable cell line in the presence of a compound relative to in the 
absence of said compound, wherein said compound does not modulate UTR-dependent 
expression if said 5' VEGF UTR in an absence of a NeRP 1 (SEQ ID NO: 4) is operably 
linked to a main ORF; and 

c) selecting for said compound that modulates protein expression of said main ORF 
through a UTR-affected mechanism mediating the effect of a NeRP. 
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46. A method of screening for a compound that modulates protein expression through a 
UTR-affected mechanism mediating the effect of a NeRP comprising: 

a) growing a stable cell line having a reporter gene operably linked to a UTR having a 
NeRPl (SEQ ID NO: 4); 

b) comparing said stable cell line in a presence of a compound relative to in an absence 
of said compound, wherein said compound modulates UTR-dependent expression if a 
NeRPl (SEQ ID NO: 4) is operably linked to a reporter gene; and 

c) selecting for said compound that modulates protein expression of said reporter gene 
through a UTR-affected mechanism mediating the effect of a NeRP. 

47. A method of screening for a compound that modulates protein expression through a 
UTR-affected mechanism mediating the effect of a NeRP comprising: 

a) growing a stable cell line having a main ORF encoding a reporter polypeptide 
operably linked to a UTR having a NeRPl (SEQ ID NO: 4); 

b) comparing said stable cell line in a presence of a compound relative to in an absence 
of said compound, wherein said compound modulates UTR-dependent expression if a 
NeRPl (SEQ ID NO: 4) is operably linked to a main ORF encoding a reporter 
polypeptide; and 

c) selecting for said compound that modulates protein expression of said main ORF 
encoding a reporter polypeptide through a UTR-affected mechanism mediating the 
effect of said NeRP. 
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SEQUENCE LISTING 

<110> PTC Therapeutics 
Cao, Liangxian 
Trotta, Christopher Robert 

<120> Methods and Agents for Screening for Compounds Capable of 
Modulating VEGF Expression 

<130> 19025.021 

<140> TBA 

<141> 2004-05-28 

<150> 10/851,074 
<151> 2004-05-25 

<160> 26 

<170> Patentln version 3.2 

<210> 1 

<211> 1038 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 1 



tcgcggaggc 


ttggggcagc 


cgggtagctc 


ggaggtcgtg 


gcgctggggg 


ctagcaccag 


60 


cgctctgtcg 


ggaggcgcag 


cggttaggtg 


gaccggtcag 


cggactcacc 


ggccagggcg 


120 


ctcggtgctg 


gaatttgata 


ttcattgatc 


cgggttttat 


ccctcttctt 


ttttcttaaa 


180 


catttttttt 


taaaactgta 


ttgtttctcg 


ttttaattta 


tttttgcttg 


ccattcccca 


240 


cttgaatcgg 


gccgacggct 


tggggagatt 


gctctacttc 


cccaaatcac 


tgtggatttt 


300 


ggaaaccagc 


agaaagagga 


aagaggtagc 


aagagctcca 


gagagaagtc 


gaggaagaga 


360 


gagacggggt 


cagagagagc 


gcgcgggcgt 


gcgagcagcg 


aaagcgacag 


gggcaaagtg 


420 


agtgacctgc 


ttttgggggt 


gaccgccgga 


gcgcggcgtg 


agccctcccc 


cttgggatcc 


480 


cgcagctgac 


cagtcgcgct 


gacggacaga 


cagacagaca 


ccgcccccag 


ccccagctac 


540 


cacctcctcc 


ccggccggcg 


gcggacagtg 


gacgcggcgg 


cgagccgcgg 


gcaggggccg 


600 


gagcccgcgc 


ccggaggcgg 


ggtggagggg 


gtcggggctc 


gcggcgtcgc 


actgaaactt 


660 


ttcgtccaac 


ttctgggctg 


ttctcgcttc 


ggaggagccg 


tggtccgcgc 


gggggaagcc 


720 


gagccgagcg 


gagccgcgag 


aagtgctagc 


tcgggccggg 


aggagccgca 


gccggaggag 


780 


ggggaggagg 


aagaagagaa 


ggaagaggag 


agggggccgc 


agtggcgact 


cggcgctcgg 


840 


aagccgggct 


catggacggg 


tgaggcggcg 


gtgtgcgcag 


acagtgctcc 


agccgcgcgc 


900 
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gctccccagg ccctggcccg ggcctcgggc cggggaggaa gagtagctcg ccgaggcgcc 960 
gaggagagcg ggccgcccca cagcccgagc cggagaggga gcgcgagccg cgccggcccc 1020 
ggtcgggcct ccgaaacc 1038 

<210> 2 

<211> 576 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 2 

atgaactttc tgctgtcttg ggtgcattgg agccttgcct tgctgctcta cctccaccat 60 

gccaagtggt cccaggctgc acccatggca gaaggaggag ggcagaatca tcacgaagtg 120 

gtgaagttca tggatgjtcta tcagcgcagc tactgccatc caatcgagac cctggtggac 180 

atcttccagg agtaccctga tgagatcgag tacatcttca agccatcctg tgtgcccctg 240 

atgcgatgcg ggggctgctg caatgacgag ggcctggagt gtgtgcccac tgaggagtcc 300 

aacatcacca tgcagattat gcggatcaaa cctcaccaag gccagcacat aggagagatg 360 

agcttcctac agcacaacaa atgtgaatgc agaccaaaga aagatagagc aagacaagaa 420 

aatccctgtg ggccttgctc agagcggaga aagcatttgt ttgtacaaga tccgcagacg 480 

tgtaaatgtt cctgcaaaaa cacagactcg cgttgcaagg cgaggcagct tgagttaaac 540 

gaacgtactt gcagatgtga caagccgagg cggtga 57 6 

<210> 3 

<211> 702 

•<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 3 

tccagagaga agtcgaggaa gagagagacg gggtcagaga gagcgcgcgg gcgtgcgagc 60 

agcgaaagcg acaggggcaa agtgagtgac ctgcttttgg gggtgaccgc cggagcgcgg 120 

cgtgagccct cccccttggg atcccgcagc tgaccagtcg cgctgacgga cagacagaca 180 

gacaccgccc ccagccccag ctaccacctc ctccccggcc ggcggcggac agtggacgcg 240 

gcggcgagcc gcgggcaggg gccggagccc gcgcccggag gcggggtgga gggggtcggg 300 

gctcgcggcg tcgcactgaa acttttcgtc caacttctgg gctgttctcg cttcggagga 360 

gccgtggtcc gcgcggggga agccgagccg agcggagccg cgagaagtgc tagctcgggc 420 
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cgggaggagc 


cgcagccgga 


ggagggggag 


gaggaagaag 


agaaggaaga 


ggagaggggg 


480 


ccgcagtggc 


gactcggcgc 


tcggaagccg ggctcatgga 


cgggtgaggc 


ggcggtgtgc 


540 


gcagacagtg 


ctccagccgc 


gcgcgctccc 


caggccctgg 


cccgggcctc 


gggccgggga 


600 


ggaagagtag 


ctcgccgagg 


cgccgaggag 


agcgggccgc 


cccacagccc 


gagccggaga 


660 


gggagcgcga 


gccgcgccgg 


ccccggtcgg 


gcctccgaaa 


cc 




702 



<210> 4 

<211> 336 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 4 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 24 0 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaagagga aagaggtagc aagagc 336 

<210> 5 

<211> 485 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 5 

gccggcggcg gacagtggac gcggcggcga gccgcgggca ggggccggag cccgcgcccg 60 

gaggcggggt ggagggggtc ggggctcgcg gcgtcgcact gaaacttttc gtccaacttc 120 

tgggctgttc tcgcttcgga ggagccgtgg tccgcgcggg ggaagccgag ccgagcggag 180 

ccgcgagaag tgctagctcg ggccgggagg agccgcagcc ggaggagggg gaggaggaag 240 

aagagaagga agaggagagg gggccgcagt ggcgactcgg cgctcggaag ccgggctcat 300 

ggacgggtga ggcggcggtg tgcgcagaca gtgctccagc cgcgcgcgct ccccaggccc 360 

tggcccgggc ctcgggccgg ggaggaagag tagctcgccg aggcgccgag gagagcgggc 420 

cgccccacag cccgagccgg agagggagcg cgagccgcgc cggccccggt cgggcctccg 4 80 

aaacc 435 
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<210> 6 

<211> 556 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 6 



cagctgacca 


gtcgcgctga 


cggacagaca 


gacagacacc 


gcccccagcc 


ccagctacca 


60 


cctcctcccc 


ggccggcggc 


ggacagtgga 


cgcggcggcg 


agccgcgggc 


aggggccgga 


120 


gcccgcgccc 


ggaggcgggg 


tggagggggt 


cggggctcgc 


ggcgtcgcac 


tgaaactttt 


180 


cgtccaactt 


ctgggctgtt 


ctcgcttcgg 


aggagccgtg 


gtccgcgcgg 


gggaagccga 


240 


gccgagcgga 


gccgcgagaa 


gtgctagctc 


gggccgggag 


gagccgcagc 


cggaggaggg 


300 


ggaggaggaa 


gaagagaagg 


aagaggagag 


ggggccgcag 


tggcgactcg gcgctcggaa 


360 


gccgggctca 


tggacgggtg 


aggcggcggt 


gtgcgcagac 


agtgctccag 


ccgcgcgcgc 


420 


tccccaggcc 


ctggcccggg 


cctcgggccg 


gggaggaaga 


gtagctcgcc 


gaggcgccga 


480 


ggagagcggg 


ccgccccaca 


gcccgagccg 


gagagggagc 


gcgagccgcg 


ccggccccgg 


. 540 


tcgggcctcc 


gaaacc 










556 



<210> 7 

<211> 294 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 7 

gctagctcgg gccgggagga gccgcagccg gaggaggggg aggaggaaga agagaaggaa 60 

gaggagaggg ggccgcagtg gcgactcggc gctcggaagc cgggctcatg gacgggtgag 120 

gcggcggtgt gcgcagacag tgctccagcc gcgcgcgctc cccaggccct ggcccgggcc 180 

tcgggccggg gaggaagagt agctcgccga ggcgccgagg agagcgggcc gccccacagc 240 

ccgagccgga gagggagcgc gagccgcgcc ggccccggtc gggcctccga aacc 2 94 



<210> 8 

<211> 194 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 8 
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cgggctcatg gacgggtgag gcggcggtgt gcgcagacag tgctccagcc gcgcgcgctc 60 

cccaggccct ggcccgggcc tcgggccggg gaggaagagt agctcgccga ggcgccgagg 120 

agagcgggcc gccccacagc ccgagccgga gagggagcgc gagccgcgcc ggccccggtc 180 

gggcctccga aacc 294 

<210> 9 

<211> 476 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 9 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaagagga aagaggtagc aagagctcca gagagaagtc gaggaagaga 360 

gagacggggt cagagagagc gcgcgggcgt gcgagcagcg aaagcgacag gggcaaagtg 420 

agtgacctgc ttttgggggt gaccgccgga gcgcggcgtg agccctcccc cttggg 47 6 

<210> 10 

<211> 554 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 10 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaagagga aagaggtagc aagagctcca gagagaagtc gaggaagaga 360 

gagacggggt cagagagagc gcgcgggcgt gcgagcagcg aaagcgacag gggcaaagtg 420 

agtgacctgc ttttgggggt gaccgccgga gcgcggcgtg agccctcccc cttgggatcc 480 
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cgcagctgac cagtcgcgct gacggacaga cagacagaca ccgcccccag ccccagctac 



540 



cacctcctcc ccgg 



554 



<210> 11 

<211> 51 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 11 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg c 51 

<210> 12 

<211> 91 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic construct 



<210> 13 

<211> 335 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 13 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaagagga aagaggtagc aagag 335 



<210> 14 

<211> 332 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 



<400> 12 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 



60 



cgctctgtcg ggaggcgcag cggttaggtg g 



91 
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<400> 14 

tcgcggaggc ttggggcagc cgggtagctc ggaggcgtgg cgctgggggc tagcaccagc 



60 



gctctgtcgg gaggcgcagc ggttaggtgg accggtcagc ggactcaccg gccagggcgc 120 
tcggtgctgg aatttgatat tcattgatcc gggttttatc cctcttcttt tttcttaaac 180 
attttttttt aaaactgttt gtttctcgtt ttaatttatt tttgcttgcc attccccact 240 



<210> 15 

<211> 331 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 15 

tcgcggaggc ttggggcagc cgggtagctc ggagtcgtgg cgctgggggc tagcaccagc 60 
gctctgtcgg gaggcgcagc ggttaggtgg accgtcagcg gactcaccgg ccagggcgct 120 

1 

cggtgctgga atttgatatt cattgatccg ggtttatccc tcttcttttt tcttaaacat 180 

ttttttttaa aactgtattg tttctcgttt taattatttt tgcttgccat tccccacttg 240 

aatcgggccg acggcttggg gagattgctc tactccccaa atcactgtgg attttggaaa 300 

ccagcagaaa gaggaaagag gtagcaagag c 331 



<210> 16 

<211> 330 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 16 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaagagga aagaggtagc 330 



tgaatcgggc cgacggcttg gggagattgc tctacttccc caaatcactg tggattttgg 



300 



aaaccagcag aaagaggaaa gagagcaaga gc 



332 



<210> 17 
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<211> 329 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 17 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 
cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 
ctcggtgctg gaattttcat tgatccgggt tttatccctc ttcttttttc ttaaacattt 
ttttttaact gtattgtttc tcgttttaat ttatttttgc ttgccattcc ccacttgaat 
cgggccgacg gcttggggag attgctctac ttccccaaat cactgtggat tttggaaacc 
agcagaaaga ggaaagaggt agcaagagc 



<210> 18 

<211> 328 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 18 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggctctact tccccaaatc actgtggatt ttggaaacca 300 

gcagaaagag gaaagaggta gcaagagc 328 



<210> 19 

<211> 326 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 19 

tcgcggaggc ttggggcagc ggaggtcgtg gcgctggggg ctagcaccag cgctctgtcg 60 
ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg ctcggtgctg 120 
gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa catttttttt 180 
taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca cttgaatcgg 240 

8 
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gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt ggaaaccagc 300 
agaaagagga aagaggtagc aagagc 326 



<210> 20 

<211> 326 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 20 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300. 
ggaaaccagc agaaagagga aagagg 32 6 



<210> 21 

<211> 316 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 21 

tcgcggaggc ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag 60 

cgctctgtcg ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg 120 

ctcggtgctg gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa 180 

catttttttt taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca 240 

cttgaatcgg gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt 300 

ggaaaccagc agaaag 316 



<210> 22 

<211> 306 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 22 

ttggggcagc cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag cgctctgtcg 60 
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ggaggcgcag cggttaggtg gaccggtcag cggactcacc ggccagggcg ctcggtgctg 120 

gaatttgata ttcattgatc cgggttttat ccctcttctt ttttcttaaa catttttttt 180 

taaaactgta ttgtttctcg ttttaattta tttttgcttg ccattcccca cttgaatcgg 240 

gccgacggct tggggagatt gctctacttc cccaaatcac tgtggatttt ggaaaccagc 300 

agaaag 306 



<210> 23 

<211> 296 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 23 

cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag cgctctgtcg ggaggcgcag 60 
cggttaggtg gaccggtcag cggactcacc ggccagggcg ctcggtgctg gaatttgata 120 
ttcattgatc cgggttttat ccctcttctt ttttcttaaa catttttttt taaaactgta 180 
ttgtttctcg ttttaattta tttttgcttg ccattcccca cttgaatcgg gccgacggct 2 40 
tggggagatt gctctacttc cccaaatcac tgtggatttt ggaaaccagc agaaag 296 



<210> 24 

<211> 286 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 
<400> 24 

cgggtagctc ggaggtcgtg gcgctggggg ctagcaccag cgctctgtcg ggaggcgcag 60 
cggttaggtg gaccggtcag cggactcacc ggccagggcg ctcggtgctg gaatttgata 120 
ccctcttctt ttttcttaaa catttttttt taaaactgta ttgtttctcg ttttaattta 180 
tttttgcttg ccattcccca cttgaatcgg gccgacggct gctctacttc cccaaatcac 240 
tgtggatttt ggaaaccagc agaaagagga aagaggtagc aagagc 28 6 



<210> 25 

<211> 276 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 25 
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gcgctggggg ctagcaccag cgctctgtcg 


ggaggcgcag 


cggttaggtg gaccggtcag 


60 


cggactcacc ggccagggcg ctcggtgctg 


gaatttgata 


ttcattgatc cgggttttat 


120 


ccctcttctt ttttcttaaa catttttttt 


taaaactgta 


ttgtttctcg ttttaattta 


180 


tttttgcttg ccattcccca cttgaatcgg 


gccgacggct 


tggggagatt gctctacttc 


240 


cccaaatcac tgtggatttt ggaaaccagc 


agaaag 




276 



<210> 26 

<211> 266 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic Construct 

<400> 26 



ggaggtcgtg 


gcgctggggg 


ctagcaccag 


cgctctgtcg ggaggcgcag 


cggttaggtg 


60 


gaccggtcag 


cggactcacc 


ggccagggcg 


ctcggtgctg gaatttgata 


ccctcttctt 


120 


ttttcttaaa 


catttttttt 


taaaactgta 


ttgtttctcg ccattcccca 


cttgaatcgg 


180 


gccgacggct 


tggggagatt 


gctctacttc 


cccaaatcac tgtggatttt 


ggaaaccagc 


240 


agaaagagga 


aagaggtagc 


aagagc 






266 
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